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Abstract 



We present a mathematical analysis of planar motion of energetic 
electrons moving through a planar dipole undulator, excited by a fixed 
planar polarized plane wave Maxwell field in the X-Ray FEL regime. 
Our starting point is the 6D Lorentz system, which allows planar 
motions, and we examine this dynamical system as the wave length 
A of the traveling wave varies. By scalings and transformations the 
6D system is reduced, without approximation, to a 2D system in a 
form for a rigorous asymptotic analysis using the Method of Aver- 
aging (MoA), a long time perturbation theory. The two dependent 
variables are a scaled energy deviation and a generalization of the so- 
called ponderomotive phase. As A varies the system passes through 
resonant and nonresonant (NR) zones and we develop NR and near- 
to-resonant (NtoR) MoA normal form approximations. The NtoR 
normal forms contain a parameter which measures the distance from 
a resonance. For a special initial condition, for the planar motion and 
on resonance, the NtoR normal form reduces to the well known FEL 
pendulum system. We then state and prove NR and NtoR first-order 
averaging theorems which give explicit error bounds for the normal 
form approximations. We prove the theorems in great detail, giving 
the interested reader a tutorial on mathematically rigorous perturba- 
tion theory in a context where the proofs are easily understood. The 
proofs are novel in that they do not use a near identity transformation 
and they use a system of differential inequalities. The NR case is an 
example of quasiperiodic averaging where the small divisor problem 
enters in the simplest possible way. To our knowledge the planar prob- 
lem has not been analyzed with the generality we aspire to here nor 
has the standard FEL pendulum system been derived with associated 
error bounds as we do here. We briefly discuss the low gain theory 
in light of our NtoR normal form. Our mathematical treatment of 
the noncollective FEL beam dynamics problem in the framework of 
dynamical systems theory sets the stage for our mathematical investi- 
gation of the collective high gain regime. 
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1 Introduction 



We present a normal form analysis of the three-degree-of-freedom Lorentz 
force system of six ODE's (ordinary differential equations) governing the 
planar (x, y — 0, z) motion of relativistic electrons moving through a planar 
dipole undulator along the z-axis perturbed by a traveling wave radiation 
field along the z direction. We are interested in the parameter range for an 
X-Ray FEL. 

Our normal form analysis is based on the Method of Averaging (MoA) 
at first order. The method has four steps. The first step is to put the 
ODE's into a standard form. The second step is to identify the normal form 
approximations. The third step is the derivation of error bounds relating 
the exact and normal form solutions. The final step is the transformation 
back to the original variables of the Lorentz force system. In the first step 
new variables are typically introduced using scalings and transformations. In 
this process we discover that the exact problem can be formulated, without 
approximation, in terms of two ODE's for the normalized energy deviation 
and a generalized ponderomotive phase. Important in this process is the 
identification of an appropriate small dimensionless parameter, often denoted 
by e, so that the system can be written as u = sf(u, t) + 0(e 2 ). In the present 
context this is the most complicated step. The normal form approximation is 
obtained by dropping the 0(e 2 ) term and replacing / by its t— average. The 
third step is often the most difficult, however here the system in standard 
form is fairly simple and we use this opportunity to give very detailed proofs 
of two averaging theorems, partly as a tutorial on the methods of proof, 
rather than applying general theorems from the literature. The latter allows 
us to obtain quite explicit error bounds which are likely near optimal. 

An electron, as a member of an electron bunch, will enter the undulator 
with a given angle in the y = plane and a given Lorentz factor. Here 
the normalized angle will be given by AP x q and the Lorentz factor will be 
written 7 = j c (l + rj) where 7 C is a characteristic value of 7 for the electron 
bunch, e.g. the mean, and rj is the so-called normalized energy deviation. 
We will replace 77 by x Ym t ne relation 77 = ex, where a posteriori e will be 
a measure of the spread of 77 values which lead to an FEL pendulum type 
behavior. We let B u , k u denote the undulator field strength and wave number 
and let E r ,uk r denote the Maxwell field strength and wave number of the 
fixed traveling wave radiation field. Thus our basic parameters are eight, 
namely AP x0 , j c , e, B u , k u , E r , k r , v. We will study the electron response to 
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*r := rT5 ■ ( L2 ) 



the radiation field as v = 0(1) varies. The choice of the parameter k r will 
be discussed below. 

For an X-Ray FEL, e is small, 7 C is large and the undulator parameter, 

eB 

K := = .93AX u [cm]B u [T] , (1.1) 

mck u 

is 0(1). Also k r = 0{k u ^1) and we define the 0(1) constant K r by 

k 

In §2.31 we will fix K T (and thus k r ) by setting 

K r = 2[1 + l -K 2 + K^AP^) 2 ]' 1 • (1-3) 

For those familiar with FEL theory, k r is, for AP x0 = 0, the usual so-called 
resonant wave number (See e.g., [JJ). The dependence of K r on AP x0 will 
be a consequence of our analysis. For the LCLS (Linac Coherent Light 
Source) X u = 3cm, mc 2 7 c = 15GeV and B u = 1.32T so that K = 3.70 (see 
|http://www-ssrl. slac . Stanford. edu/lcls/lcls_parms .html p . 

Mathematically then, we are interested in an asymptotic analysis of the 
electron motion for e small and 7 C large as v varies. In particular we are 
interested in the (e, j c ) regime that gives rise to the pendulum type behavior 
important for the functioning of an X-Ray FEL. We find that in order to 
obtain this behavior, in the MoA at first-order, there must be a relation 
between e and j c . Introducing the normalized field strength 

£ = k • (L4) 

we show a pendulum type behavior emerges when e = 0(a/£/7 c ) for 7 C ^> 1. 
Without loss of generality we will take the order constant to be 1, and choose 



(1.5) 

We also show that, for e small, the system associated with (jl.5p has a reso- 
nance structure, such that as v varies the system goes through a sequence of 
nonresonant (NR) and near-to-resonant (NtoR) zones. The associated NtoR 
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approximating normal forms are pendulum like and reduce to the standard 
FEL pendulum system for AP x0 = and v an odd integer. This behavior 
is not present for e <C 1/7 C or e ^> l/j c and so we refer to (II. 5p as a distin- 
guished case. This turns out to be a very simple example of the concept of 
a "distinguished limit" in the singular perturbation literature. This can be 
seen in action in the context of our equations (I2.56P and (I2.57p . 

In summary, for the distinguished case of (|1.5j) . our basic nondimensional 
parameters are K, AP x q, £, e, v. For e small we will obtain a sequence of non- 
resonant (NR) and near-to-resonant (NtoR) normal form approximations as 
v varies. The NtoR normal forms can be understood in terms of the sim- 
ple pendulum system and reduce to the usual FEL pendulum equations for 
AP x o = and v an odd integer (See Sections 13.4.21 and 13.4.31) . The NtoR 
normal form allows us to study the effect of v being slightly off resonance. 
This completes the first two steps in the MoA. In the third step we prove two 
theorems which give error bounds, relating the exact and normal form solu- 
tions, which go to zero as e — > 0+. Our goal is to present a mathematically 
rigorous analysis that is self contained. 

Standard derivations of the FEL pendulum equations can be found in 
[2] , [3] , [I] , [5] . They differ from our approach in that they start from the 
ODE for the normalized energy deviation, rj, and use physical reasoning to 
introduce approximations leading to the FEL pendulum normal form for 
AP x o = 0. In contrast, our starting point is the three-degree-of- freedom 
Lorentz force ODE's which are clearly more general and we make no approx- 
imation in going to the standard form for the MoA. Thus our only approx- 
imation is in going from the averaging standard form to the normal form 
approximations. Furthermore we obtain error bounds which do not appear 
to be possible in the standard derivations and these bounds are covered by 
our averaging theorems. Our definition of resonance is intimately linked to 
the derivation of our averaging normal forms, whereas in the standard deriva- 
tions resonance is introduced in the context of maximizing energy exchange. 
We emphasize that we obtain more than the pendulum normal form; we also 
obtain the more general NtoR normal form as well as the NR normal forms. 

We do not intend to minimize the importance of the standard derivations, 
the physical derivations are certainly important and as is often the case 
show great physical insight. Here we want to show what can be done in 
a mathematically rigorous way in the context of dynamical systems theory, 
but in that we have been guided by and are indebted to the work of e.g., 
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For ODE's, the MoA is the most robust of the longtime perturbation 
theories which include e.g., Lindstedt series [B], multiple scales [5J, renor- 
malization group methods [7] and Hamiltonian perturbation theory [8]. For 
example, Hamiltonian perturbation theory has the advantage that one is 
transforming a scalar function, however the MoA is more robust in that 
transformations and scalings are not restricted to canonical transformations. 
Central to the MoA, and in contrast to those just mentioned, is the derivation 
of error bounds. We emphasize these are true bounds and not just estimates. 
The MoA is a mature subject and there are several good books, see [6j [9j [10] 
for example as well as the Scholarpedia articles [ITJ [12]. We refer to the 
MoA approximation as a normal form. Generally, a normal form of a math- 
ematical object is a simplified form of the object obtained with the aid of, 
for example, scalings and transformations such that the essential features of 
the object are preserved. Here we not only preserve the essential features of 
the exact ODE's but bound the errors in the approximation with a bound 
proportional to the small parameter e. See [11] for the use of normal form 
in a similar context. 

This paper has a pedagogical aspect, giving the reader, who may not be 
familiar with modern long time perturbation theory, an introduction in a 
context where the proofs are easily understood. In addition, we hope that 
both newcomers to the field and mathematical scientists will find this a good 
introduction to the noncollective case of an FEL. We also hope that experts 
will find something of interest. The reader does not need to be familiar 
with averaging theory as we give complete proofs including detailed error 
bounds. Furthermore we obtain better results as our theorems are tuned to 
the problem at hand. In addition, to our knowledge, the treatment of the 
undulator problem in the mathematically rigorous and self-contained way 
that we do here has not been done before. Our mathematical analysis is 
not deep, using only undergraduate mathematics as commonly taught in 
advanced calculus courses, however it is complicated and somewhat intricate 
in spots. Finally, for us, it sets the stage for our more serious goal of a deep 
mathematical understanding of the collective high gain FEL theory. 

We proceed as follows. In §0 we start with the three-degree-of-freedom 
Lorentz equations with a general traveling wave field in f l2.7p - fl2.10p and then 
introduce z as the independent variable. The system has planar solutions 
where = y = p y and using a conservation law we arrive at a system of two 
ODE's fl2.33p . fl2.34p for the energy deviation and a precursor to a generaliza- 
tion of the so-called ponderomotive phase. By scalings and transformations 
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we discover the distinguished case of (jl.5p which then leads to a standard 
form for the method of averaging in fl2.62p . fl2.63p . The two dependent vari- 
ables are now a scaled energy deviation and a generalization of the so-called 
ponderomotive phase. 

In §3]we present our main results. We begin by introducing the monochro- 
matic traveling wave field, the case of main physical interest. The system 
is carefully defined in §3.11 In §3.21 we define nonresonant, A-nonresonant, 
resonant, and near-to- resonant v in the Mo A context. We emphasize that 
as v varies the system passes through resonant and nonresonant zones. The 
NR case, its first-order averaging normal form and associated solutions are 
presented in §3.31 along with a proposition giving an appropriate domain for 
the associated vector field. §3.31 sets the stage for the more interesting NtoR 
case of §3.41 The NtoR system is carefully defined along with a proposition 
giving an appropriate domain for the associated vector field. The first-order 
averaging normal form is derived and solutions written in terms of solutions 
of the simple pendulum system. It is unlikely that all v values are covered 
accurately by our normal forms, however we are able to argue in §3.4.41 that 
there is a sense in which the NR case emerges from the NtoR case. The 
third and fourth steps of the Mo A are performed in §3.51 and §3.61 In fact, 
the statements of our first-order averaging theorems, which give an order e 
bound on the error for long times, i.e., intervals of 0(l/e), are presented in 
§3. 51 and applied to the phase space variables in §3.61 By taking special initial 
conditions (AP x0 = 0) we recover the result of standard approaches which 
focus on the energy transfer equations alone and do not consider the phase 
space variables. Finally in §3.71 we use our results in a low gain calculation 
and compare the result with [2]. 

The proofs of the two averaging theorems are presented in §H and they 
are based on an idea of Besjes (see [131 [HI US]) which leads to proofs without 
using a near-identity transformation, as in usual treatments of, e.g., [6j |9j 
[TO] . The NR case is an example of quasiperiodic averaging with a rigorous 
treatment of a small divisor problem in what is surely the simplest setting. 
The NtoR case is an example of periodic averaging. A novelty of our approach 
is that we use a system of differential inequalities, rather than the usual 
Gronwall inequality, to obtain better error bounds. 

The appendices contain calculations needed in the main text. Appendix 
lAl provides properties of the Bessel expansion of the function jj which is 
introduced in Section 13.21 In Appendices iBfUl we study the next-to-leading 
order terms g±,g2 used in Theorem [TJ and in Appendices iDllEl we study the 
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next-to-leading order terms gi,g^ used in Theorem 123 Appendix [F] gives an 
outline of a rigorous approach to regular perturbation theory which could be 
made into a theorem at the level of our averaging theorems. It is applied in 
§3.4.41 Appendix iGl provides some formulas used in Section 13.71 In Appendix 
|H]we discuss S = E r /cB u in the high gain regime and obtain a crude upper 
bound estimate of it. Finally, in Appendix |T] we show that the solution of the 
system of differential inequalities that is used in the proof of both averaging 
theorems (as well as in Appendix [F]) is indeed a solution. 

2 General Planar Undulator model 

2.1 Lorentz force equations 

Using SI units, the Lorentz equations for motion of a relativistic electron in 
an electromagnetic field, (E,B), are 

r = v(p), (2.1) 
p = -e(E + v(p)xB), (2.2) 



with ' = d/dt and where 



v(p) = -P- , (2.3) 

m'-f 



is the velocity, 7 is the Lorentz factor defined by 

7 2 = 1 + p • p/mV , (2.4) 

and m and — e are the electron mass and charge respectively. We introduce 
Cartesian coordinates as follows: 

r = xe x + ye y + ze z , (2.5) 

p = p x e x + p y e y + p z e z , (2.6) 

where e x ,e y ,e z are the standard unit vectors. Using (I2.ip -( T2~11 the system 
in Cartesian coordinates is 

Px ■ Py ■ Pz /r> >t\ 

x = , y = — y - , z = , (2.7) 

7717 7717 7777 

p x = -e[E x + v y B z - v z B y ] , (2.8) 
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py = -e[E y + v z B x - v x B z ] , (2.9) 
p z = -e[E z + v x B y - v y B x ] . (2.10) 

We denote the undulator magnetic field by B u and the radiation field by 
(E n B r ) whence 

E = E r , B = B r + B u . (2.11) 

A simple planar undulator model magnetic field which satisfies the Maxwell 
equations, V ■ B n = and V x B u = 0, as in j3], is 

B u = -B u [cosh(k u y) sm(k u z)e y + smh(k u y) cos(k u z)e z ] , (2.12) 

where B u > 0. Since V x B„ = there is a scalar potential <fi such that 
B u = V0. To satisfy V • B M = 0, <fi must satisfy Laplace's equation. The 
field f)2.12p is easily constructed by separation of variables and requiring 
periodicity in z with period \ u and then taking the first eigen-mode (See, 
e.g., [ISJ p. 145]). The scalar field is = —(B u /k u ) smh(k u y) sm(k u z). 

The traveling wave radiation field we choose is also a Maxwell field and 
is given by 

1 E r 
E r = E r h(a)e x , B r = -(e z x E r ) = — h(a)e v , (2.13) 

c c 

where E r is a constant, h is a real valued function on K. and 

a(z,t) = k r (z-ct) , (2.14) 

and k r is the parameter mentioned in the Introduction. 

Our primary emphasis is on the standard monochromatic example where 

H(&) = sm(u&) , h(a) = H'(a) = cos(z/d) , (2.15) 

and v > 1/2 thus h(a(z, t)) = cos(uk r (z — ct)). Note that the prime ' always 
indicates a derivative. Thus from §3]onwards we will use f l2.15p . However it is 
easy to carry through the first part of the analysis with general H and we do 
want to make a comment on the more general case. In this monochromatic 
case k r will be defined by fll.2p . fll.3p and the v will allow for a variable wave 
number for the traveling wave; it will be shown that v = 1 gives the primary 
resonance with the concomitant pendulum normal form. The extension to a 
sum of monochromatic waves is trivial and won't be discussed. 
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Using (1X3)1 . (1232]) . (I2TT3]) one can write (T2~gD- ([2~TUj) as 

Pz P 

Px = -e[—B u cosh(k u y) sin(k u z) —B u smh(k u y) cos(k u z) 

+Er (l--EL.)h(&(z,t))] , (2.16) 
rrryc 

p y = -e—B u smh(k u y) cos(k u z) , (2.17) 

p z = -e[-—B u cosh(k u y)sin(k u z) + E r ^-h(a(z,t))] . (2.18) 
rri'-f Tw-fc 

It is easy to check that ( 12 . 7p . ( |2~T6|) - ( 12 . 1 8]) is a Hamiltonian system with 
Hamiltonian "H: 



"H = c^(P c + eA(r, t)) 2 + m 2 c 2 = mc 2 7 , (2.19) 

where the canonical momentum vector P c is related to p by p = P c + eA 
and the vector potential A is given by 

B E 
A(y, 2, t) = [-r^ cosh{k u y) cos(k u z) + -^-H{a{z, t))]e x . (2.20) 

Since A is independent of x the x-component, P CjX , of the canonical momen- 
tum vector P c is conserved, i.e., 

Px -eA x {y,z,t) , (2.21) 

is constant along solutions of (I2.7p . (l2.16p - (l2.18p as is easily confirmed directly. 
We will not make explicit use of the Hamiltonian structure in the following. 
The MoA does not rely on a Hamiltonian structure and this frees us from 
having to deal only with canonical transformations as we proceed to put 
(I2.7l) . (l2.16p -( I2.18| in an averaging standard form. 



2.2 Motion in y = plane with z as the independent 
variable 

It is common to take the distance z along the undulator as the indepen- 
dent variable, rather than the time t. In fact after unsuccessfully trying 
to stay with t we decided to follow the common procedure. With the usual 
abuse of notation, we write, from now on x(z), y(z),p x (z),p y (z),p z (z) instead 



12 



of x(t(z)),y(t(z)),p x (t(z)),p y (t(z)),p z (t(z)) whence the ODE's (1277)) 
(gJSD become 



dx = Px dy = Py dt L = m^ 
dz p z 1 dz p z dz p z ' 

-7^ = --[cB u cosh(k u y) sin(k u z) - —cB u smh(k u y) cos(k u z) 
dz c p z 

+E r ( T ^ - l)h(a(z,t))} , (2.23) 

Pz 

dpi = --^cB u smh(k u y) cos(k u z) , (2.24) 
dz cp z 



= — - \——cB u cosh( k u y) sm( k u z) + E r —h(a(z,t))} . (2.25) 
dz c p z p z 

The initial conditions at z = will be denoted by a subscript 0, e.g., t(0) = t . 
Clearly t is the arrival time of an electron at the entrance, z = 0, of the 
undulator. 

Here and in the rest of the paper we consider the initial value problem 
(IVP) with y = p y Q = 0. It follows, with no approximation, that y(z) = 
p y (z) = for all z and the six ODE's (|2T22j) - ( 12^51) reduce to four. The 
righthand sides (rhs's) of (12 . 2 2 [) - ( |2 . 2 5 f) are independent of x and so we do 
not need to consider the x equation until §3.61 It is standard, and also quite 
convenient, to replace p z by the energy variable 7. With 7(2;) defined in 
terms of p x (z) and p z {z) by (12 .4[) and using (I2.23|) and (I2.25j) . we obtain 
7' = {.PxP'x + PzP' z ) /m 2 c 2, j = — (eE r /mc 2 )(p x /p z )h(a(z,t)). Finally, we take 
a as a dependent variable in place of t and we define 

a{z) := a{z, t(z)) = k r (z - ct(z)) . (2.26) 

Later it will be seen that a is a precursor to a generalization of the so-called 
ponderomotive phase which emerges naturally as we put the ODE's in a 
standard form for averaging. 

With the above four changes the ODE's for t,p x ,p z in f l2^ .f l2T23|) .f l2T25|) 
become 

£ = Ml - ■ (2-27) 

dz p z 

= --[cB u sm(k u z) + E r (^ - l)h(a)} , (2.28) 
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where the initial conditions are a(0) = «o := —k r cto,p x (0) ='■ PxO,j(fy ='■ 7o- 
Here p z must be replaced by 



Pz 



Vm 2 c 2 ( 7 2 -I) -pi , (2.30) 



and it is easy to see that (I2.27p -f l2.29p are then self contained. From now on 
we restrict p z to be positive: 

p z > . (2.31) 

Note that, by (12.271) . a is a strictly decreasing function whence, as one ex- 
pects, z < c(t(z) — t ). It is also easy to check that 

cos(fc^) - J^-^H(a) , (2.32) 



mcK cB u k r 

is conserved along solutions of f l2.27p - fl2.29p . This conservation law is identi- 
cal to fl2T2Tj) with y = 0. Recall that K was defined by (TO) . 

In summary, the solution of the I VP for fl2.22l) - fl2.25l) with y = p y0 = 0, 
which entails y = p y = 0, is given in terms of the solution of fl2.27p . fl2.29p . 
i.e., of 

^ = *r(l-— ), «(0) = a , (2.33) 
dz p z 

dl eE r p x 

— = — h(a) , 7(0) = 70 , 2.34 

dz mc z p z 

with 

(E . k \ 

cos{k u z) - 1 + -^r-jrl H ( a ) - H ( a o)}J > ( 2 - 35 ) 

and p z in (12.301) . To complete the solution of fl2.22p -f f2T25|) it suffices to note 
that t(z) is determined from (I2.26P in terms of a(z) and x(z) is determined 
from (I2.22p by integration. 
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2.3 Standard form for Method of Averaging 

We begin by introducing the normalized energy deviation rj and its 0(1) 
counterpart x via 



as mentioned in the Introduction. Here 7 C is a characteristic value of 7, e.g., 
its mean and e is a characteristic spread of rj so that \ becomes the new 
0(1) dependent variable replacing 7 in fl2.33p . fl2.34p . We are interested in 
an asymptotic analysis for 7 C large and r\ small as in an X-Ray FEL. Here 
we determine a relation between e and 7 C which leads to a standard form for 
the MoA and which will contain the FEL pendulum system at first order in 
the case of (12715)1 . 

As a first step we introduce new variables, in addition to \i as follows. 
From the conservation law in (I2.32p we anticipate that the order of magnitude 
of p x will be mcK. In addition j3 z := p z /may will be near 1 and so p z ~ mc-f. 
Thus we define dimensionless momenta by 



7 = 7c(l +rj) = Tc(l + e X) , 



(2.36) 



p x = mcKP 2 



X ■ 



Pz = mcrfP z . 



(2.37) 



Of course, by fT2~3T]) . 



P z >0- 



(2.38) 



A natural scaling for z is 



z = C/k 



(2.39) 



so that the undulator period is 2ir in (. 
Abbreviating 



8aux(() ■= a((/k u ) , 



(2.40) 



and with (JO]) the system (127331) . (127341) becomes 




aux 



f 1 P 

K 2 —^ — - — — 

eY c 1 + exPz 



h(8, 



aux 



(2.41) 



(2.42) 
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where ' = d/d( and £ is defined in (jl.4p . The initial conditions are 6 aux (0, e) 
6*o := O!o,x(0, e) = Xo- Moreover P z must be replaced, due to (12.301) . by 



P Z = ^1-±{1 + K*P%) with 7 = 7c (l+e X ), (2.43) 
and P x must be replaced, due to (12.351) . by 

P x = cosC + AP x0 + -J—[ H {6 aux ) - H(9 Q )] , (2.44) 



where 



AP x0 :=P x0 -l, P x0 := P x (0) = ■ (2.45) 

mcK 



Since p z > we have < P z < 1. We note that most derivations of the FEL 
pendulum take AP x0 = 0, see El H |5] . 
To expand P z we need 



1 + K 2 P 2 = 1 + if 2 (cos C + AP Q 



x0) 



2K 2 £ K 2 £ 2 
f — -(cosC + AP x0 )(H (6 aux ) - H(9 )) + -—(H(6 aux ) - H(6 )) 



2 



(2.46) 

and it is convenient to define 

K 2 

q(Q := 1 + i^ 2 (cos C + AP x0 ) 2 = q + 2K 2 AP x0 cos C + — cos 2( , (2.47) 

1 



q:=l + -K 2 + K 2 (AP x0 ) 2 . (2.48) 

Clearly q is the average of q(() over £. Now P x is 0(1) so, by (12 .43 j) . 

1 1 1 + K 2 P 2 1, 
— = 1 H 2 h Of — ) 

ft + 2 7 2(i+e X )2 + Sr 

= l + |(|(l_ 2£x + 0(, 2 )) + 0(l) 

= 1 + #1-2 £X) + 0(1) + 0(J). (2.49) 
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Thus using (EBD and ( E3SD , eq.'s (12T4Tj) and (g32D become 
2 7 



C = ~ ^ + e*r?(C)x + 0(3) + 0(e 2 ) , (2.50) 



x' = -/^ 2 ^(cosC + AP xQ )h(9 aux ) + 0(l/ 7 2 ) + 0(l/e 7c 4 ) • (2.51) 

To transform (j2.50B . fl2.5ip into a standard form for the MoA we need to 
introduce dependent variables that are slowly varying. We anticipate that x 
will be slowly varying, i.e., -^j will be small. To remove the 0(1) in fl 2 . 5 j) 
we define 

0:=e aux + Q(C) , (2.52) 

where 

Q(C) :=C + T sinC + T 1 sin2C, (2.53) 

2K 2 AP x0 K 2 . 

T := -A Ti := — . 2.54 

Note that T and Ti depend only on K and AP x0 and that 

0/(0 = ^2 . (2.55) 
Thus the system (12.501) . (12. 511) becomes 

9' = eK r q(C) X + 0(l/ 7c 2 ) + 0(e 2 ) , (2.56) 
X = -K 2 ^-(cos( + AP x0 )h(6 - Q(C)) + 0(l/ 7 2 ) + 0(l/e 7c 4 ) • (2.57) 

The initial conditions are 9(0, e) = 9q, x(^i e ) = Xo- To obtain a system where 
9 and x interact with each other in first-order averaging we must balance the 
0(e) term in (I2.56P with the 0(8/ £ 7 2 ) in ( I2.57p . In this spirit we relate e 
and 7c by choosing 

e = — 2 , (2.58) 

and so we obtain (jl.5j) . It is this balance that will lead to the FEL pendulum 
equations in This is the distinguished case mentioned in the Introduction 
and the system (j2.56p . fl2.57p can be written 

9' = eK r q(Qx + 0(e 2 ) , (2.59) 



17 



X ' = -eK 2 (cos( + AP x0 )h(6 - Q(0) + 0(e 2 ) , (2.60) 

which are now in standard form. Up to this point K r has not been fixed but 
now it is convenient to take 

K r = 2/q , (2.61) 

which we do from now on. Using (I2.48p . (12.611) is identical to (11.31) . Fur- 
thermore in the monochromatic case of (12.151) and §3], we will see that, with 
(I2.6ip . the primary resonance appears at v — 1. 
With ( glEED the ODE's (123511 . (1230]) become 

9> = e^ X + 0(e 2 ), (2.62) 
X ' = -eK 2 (cos( + AP x0 )h(6 - Q(Q) + 0(e 2 ) . (2.63) 
We now relate 9 to the so-called ponderomotive phase. We have, from 

d22nD,d23nD, and fl233J, 

0(C, e) = ^(C - k u ct({/k u )) + [C + To sin C + T x sin 2(] . (2.64) 

Using fl2T3^|) and (glMD we obtain 

9{k u z, e) = k r {z — ct(z)) + k u z + T sin k u z + Ti sin(2/c u 2;) . (2.65) 
For AP x -o = the variable 9 is the so-called ponderomotive phase, i.e., 

9(k u z, e) = {k u + k r )z — k r ct(z) + Ti sin(2k u z) , (2.66) 
where, for AP x0 = 0, 

k r K 2 _ K r K 2 _ K 2 K 2 

1 ~ 8k^ c ~ ~ 8"" ~ ~Aq~ ~ 4 + 2K 2 ' ( ' ' 

Thus in our context the ponderomotive phase arises naturally in the process 
of finding the distinguished relation between e and 7 C and transforming to 
slowly varying coordinates. In standard treatments it is introduced heuristi- 
cally to maximize energy transfer. 

To make the 0{e 2 ) terms in fr2T62|) .f l2T63|) explicit we first rewrite f[2H|) . f[2T42]) 
in terms of e, K and £ as 

28 1 

6 'aux = -^(1 " p") , (2-68) 



X = -K 2 e-^—^h(e aux ) , (2.69) 



where 



F* = + e X )- 2 (l + K 2 P>), (2.70) 

2 — 

P x = cosC + AP x0 + £ -A[ H (6 aux ) - H(9 )] . (2.71) 

The initial conditions are ^^(O,^) = ^^(O,^) = Xo- Under fl2.52IUI2.6ip . 
the system becomes fl2.68p . fl2.69p becomes 

& = %(1 - h + ^ , (2.72) 
£ 2 q P z q 

X = -e&jl—^KO - Q(O) , (2.73) 

-L ' ^X M 

where 

2 — 

P x = cosC + AP a . + ^[#(0 - Q(C)) - ff(0„)] • (2-74) 

The 0(e 2 ) terms in fl232l .f TZ63l) can now be determined by comparison with 
f l2.72p .f l2T73l) . We will do this in the monochromatic case of §£2 
Remarks: 

(1) Note that, by (Op . flDj]) . 7 C = y/E/e, in particular 7 C > and, by 

7 = 7 c (l + ^x) = v^(i + x). (2-75) 
Since, by f !2.3ip . we have the restriction 7 > 1 we also have, by (12.75 p . 

l + ex>0. (2.76) 



Because, by (I2~T3"H]1 . P 2 > 0, Eq. fl2~7D|) gives ^^1 + Ji 2 P2 < |1 + ex I 
and (I2.76P gives 



1 1 



x >-- + —^YT1^p! . (2.77) 
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Note that ( I2.77P defines our maximal domain of points (0,X>C)> i n 
particular it entails (I2.38p . (l2.76p . We will in §3.11 further restrict this 



Of course always 7 > 1 and, in fact, in applications 7 C , 7 ^ 1. However 
for our purposes it is convenient to base our work on the maximal 
domain fl2T77]) . 

(2) The transformation to the slowly varying 9 in (I2.52p works nicely be- 
cause ( (equivalently z) is the independent variable. If we had stayed 
with t as the independent variable this step wouldn't work. 

(3) Equations f l2.62p .f l2.63p are in the standard form for the MoA. How- 
ever we did not prove that the 0(e 2 ) are actually bounded by an e- 
independent constant times e 2 . In the monochromatic case in £J3] we 
will show that the two 0(e 2 ) terms are truly bounded by Ce 2 on an 
appropriate domain for appropriate constants C. 

(4) For the results of this paper the normalized field strength £ cannot be 
too big (or e won't be small) and it cannot be too small or another 
distinguished case will come into play. Of course for a seeded FEL, £ 
will be set by the seeding field. In Appendix [H] we present two very 
crude bounds that have some relevance to the beginning stages of a High 
Gain FEL. Here we simply note that for £ = 1000, e is approximately 



In an early approach to this problem we built a normal form analysis 
assuming £ small, so that the radiation field was a small perturbation 
of the undulator motion. We thus considered £ as a small parameter 
in addition to 1/7 C . This led to another distinguished case, which also 
had a resonant structure but with a different pendulum type behavior. 
Later we realized that £ is not necessarily small for cases of interest 
and we were led to the current case of fll.5p . 

(5) As will become clear in §0 the normal form for ( I2.62p is 9' = s2\- 
The normal form of (I2.63P depends on h. In the monochromatic case 
h(9 — Q(Q) = cos(u[9 — Q(C)}) an d the nonresonant, resonant and near- 
to-resonant structure will appear as v varies. In particular the primary 
resonance will appear at v — 1. However it is curious that if 



domain. 



0.001. 




(2.78) 
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with h(£) smooth and localized near £ = ±1 the resonance effect is 
washed out in first-order averaging. We will explore this briefly in §5\ 
We are studying the consequence of this in the collective case. 



3 Special Planar Undulator Model and aver- 
aging theorems 

We have the planar undulator in a standard form for the MoA in (I2.62p .f p.63p 
where the 0(e 2 ) terms can be determined from fl2.72p .f j2.73p . We now special- 
ize to a monochromatic radiation traveling wave, write the system in Fourier 
form, discuss resonance as a normal form phenomenon, develop the NR and 
NtoR normal forms and state two theorems giving precise bounds on the 
normal form approximations. Thus from now on the radiation field in (I2.13P 
is monochromatic, i.e., h, H have the form ( I2.15P with v> 1/2. 



3.1 The basic ODE's for the monochromatic radiation 
field 

In this section we introduce the notation which will allow us to state and 
prove our three propositions and two theorems. With f l2.15p .f l2~70|) . (I2.74p 
we show the dependencies of P x and P z on (9, x, C> £ i v ) by the replacement 

P X = U X , P Z = U Z , (3.1) 

where 

2 — 

11,(0, C, e, v) := cosC + AP x0 + ^WW* " Q(0\) ~ sin(z/0„)] , (3.2) 



s 2 



U z {9, x, C, e, v) :— — — (1 + s X )- 2 (l + Km%{9, (, e, v) . (3.3) 



Note that, by f[2777|> . (j53 ]> . 



1 1 



X > -- + VI + Kmi(9 } C, e, v) . (3.4) 

From now on, we restrict e to a finite interval (0, £o]. We are of course 
interested in e small, i.e., 0<£<1, and so, without loss of generality, we 
take 

0<e<e , < £ < 1 ■ (3.5) 
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Using fl3.4p .( )3.5|) we define the open set V(e, u), for < e < e , v > 1/2, by 

V(e, u) := {(9, X , C) e R 3 : X > ~\ + ^VT+Mp^)} , (3.6) 

which is our maximal domain in extended phase space. Accordingly we define 
the domain of Tl x to be {(0,C> £ , u ) E : < e < e , u > 1/2} and the 
domain of U z to be {{9,x,C,e,v) E (V(e,iy) x R 2 ) : < e < e , , v > 1/2}. 
It is easy to check that on the domain of U z the argument of the square root 
in (13.31) is positive and, for (9, x, C) £ *D( e , v \ we have (I2.76P and 

O<n,(0, X ,C,e,i/)<l. (3-7) 

Moreover with (123511 the ODE's (127721 . (127731) become 

= ^ (1 - n,(9,X,C,e,v) } + ? ' (3 ' 8> 

^-^T^i^S-^-Q(C)l), (3.9) 

where g and Q are defined in (I2.47IUI2.53I) . Of course the initial conditions 

are 9(0, e) = 9 ,x{0, e) = Xo- 

As suggested by (I2T62D . (12331) we now write (l378|) .( l37^|) as 

9' = ef 1 (xX) + e 2 gi(9,xX;e,is), (3.10) 
X' = ef 2 {9, C; i/) + e 2 g 2 {9, X , C; e, ^) , (3.11) 

where /i, / 2 are given by 

/i(x,C):=^P^, (3-12) 
Q 

f 2 (9, C; i/) := -^ 2 (cosC + AP x0 ) cos(u[9 - Q(()}) , (3.13) 
so that gi , g 2 are given by 

x, C; e, ,) := |d - n<(g> ^ >£>|/) ) + f d " **) , (3.14) 



e 2 g 2 (9, x, C; e, ^ ) := ^ 2 cos(z^ - Q(C)]) [cos C + AP, 
1 + 11,(0, x,£e,i/) J ' 
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(3.15) 



The ODE's (1378]) . (15791 and their equivalent form, fl3TTU|) . flBTTT]) . will be the 
subject of Theorem [Tj i.e., the averaging theorem for the NR case (see also 
Definition Q] in §3.2|) . They will also be the basis for the NtoR case. 

We need an appropriate domain for the vector field in (I3.10p .f p.lip when 
it comes to averaging theorems. There are two types of singularities in 
f l3.10p .f l3~TT|) . The first involves the e dependence of gi,g2 as e — > 0+. On 
the surface it appears that the first term on the rhs of f)3.14p is 0(l/e 2 ), 
however it is 0(1). In fact, when combined with the second term the rhs 
is 0(e 2 ) so that g\ is 0(1). Similarly, g 2 appears to be 0(l/e), however 
again there is a cancellation so that g 2 = 0(1). This should not come as 
a surprise since the construction of the distinguished case (see the remarks 
before (I2.59P ) Proposition 1 makes this precise by finding the limits of g±,g 2 
as e 0+. Thus the e = singularity is removable. There are also singu- 
larities for Il 2 = 0,ex = — 1 which are not removable. This is reflected in 
the fact that even though fx, f 2 are nice, gi,g 2 have these singularities. How- 
ever these singularities are excluded from our maximal domain V(e, v) (see 
(1277511 . (157711 ) and so the vector field in ffHTTU]) . ffHTTT]) is of class O 00 on T>(e, v) 
for < e < Eq < l,u > 1/2. Nevertheless since T>(e, v) is dependent on e 
it is inconvenient to use it in an averaging theorem. Thus we now restrict 
T>(e, v) to an e-independent domain W{eq) x R. 

To motivate W we note that, by f|3.2p and since v > 1/2, 

\n x (0,(,£,v)\ <U X)Vb (e) , (3.16) 

where 



U XtUb (e):=l + \AP x0 \+2e 2 q. (3.17) 
Clearly, by fl37l6D . fl37i71) . 

<-^ + ^l + Kmi ub (e ), (3.18) 

whence, by (13. 6p . we can "shrink" the maximal domain V(e, v) to the e- 
independent domain W (eq) x K. where 

W(e) :=Rx (xi b (e), oo) , (3.19) 
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with 

Xw(s) ■= -\ + -j=y/l + K*ILl tVb (e) . (3.20) 

3.2 Resonant, nonresonant, A-nonresonant, near-to- 
resonant 

Now that the structure of the gi have been characterized at the level needed 
for the averaging theorems, we discuss the structure of the /, defined in 
fl3TT2|) . fl3TT3l . Clearly f x is 2tt periodic in (. We write, by fl233|) . fl3TT3|) . 



f 2 (6, C; v) = -isf 2 (cos C + AP x0 ) cos [v6 - v( - uT sin C - uT 1 sin 2( 

=: M9,CM;v) , (3-21) 
where f 2 (9, Ci, C 2 ; v) := -/^(cosCi + AP^) 

x cos I u8 — ( 2 — ^T sinCi — z/Yisin2(i ). Since f2(0, £1, C2; 27 ) is of class 



C 00 in (Ci, C2) and 27r-periodic in d and C2 we conclude from f )3.2ip that f 2 
is a quasiperiodic function of ( with two base frequencies 1 and v (for the 
definition of quasiperiodic functions, see, e.g., [9]). To make the resonant 
structure explicit we write f2 as 

/ 2 (0, C; 1/) = exp(zz/(# - (J )././((,: 1/, AP X . ) + cc , (3.22) 

where 

jj((;v,AP x0 ) := (cosC + AP E o)exp(-w[T smC + T 1 sin2C]) , (3.23) 
is 27r-periodic in (. The Fourier series of jj is 

is, AP x0 ) ~ ^ jj(n; u, AP x0 )e l < , (3.24) 



with 



fj(n; u, AP x0 ) := ±- [ d(jj((; v, AP x0 ) e"** , (3.25) 
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and Z being the set of integers. Since v, AP x o) is a 2-7r-periodic C°° 
function its Fourier series ( I3.24p is absolutely convergent, i.e., 
Snez \ jj( n 'i v i ^Pco)! < oo whence ~ in (13 . 24[) can replaced by =. The f 2 in 
Eq. ( 13. lip can now be written 

f 2 (9, C; v) = -^ 9 Y,Tj{n\ AF l0 )e j( "^ )c + cc , (3.26) 

which clearly shows the resonant structure in that the ( average of f 2 is zero 
for v ^ integer. In Appendix |X] we find 

jj(n; v, AP x0 ) = ~J(n, 1, v, T , T x ) + ^J(n, -1, u, T , Ti) 

+AP x0 J(n,0,u,T o ,T 1 ) , (3.27) 

where 

J(n, m, v, T , T x ) := ^ </ m -n-2zO T o)^0 T i) , (3.28) 

and Jk is the /c-th-order Bessel function of the first kind. Note that 

AP x0 ) = jj(C; f, AP x0 )* which implies jj(n; v, AP xQ ) is real. This 
is confirmed in the explicit form of ( I3.27p .( 13".28p since the are real valued. 
The time average of fi in (I3.12p is clearly 

U X ) := lim [I f A( X , QdQ = 2 X . (3.29) 



Since the series in ( I3.26P converges uniformly in ( and since exp(i(n — 
5n tlJ , the time average of the quasiperiodic f 2 is 

Ud-u) := lim [I T / 3 (fl,C;v)dC] 

if f £ N 



# 2 jj(A;; fc, AP x0 ) cos(^) if i/ = jfe e N , ^ 3 ' 30 ^ 

where N denotes the set of positive integers and where we have used the 
fact that jj is real. This forms the basis of our definitions of resonant, 
nonresonant and near-to-resonant frequencies v. 
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Definition 1. (Resonant, nonresonant, A-nonresonant, near-to-resonant) 
Let v > 1/2. We say v is nonresonant (NR) ifu^N and resonant otherwise. 
We also say that v is A-nonresonant (A-NR) when v £ [k + A, k + 1 — A] 
with A £ (0, 0.5) and k £ N. Note that v is NR if it is A-NR. We say that v 
is near-to-resonant (NtoR) if v = k + ea where k £ N, a £ [—1/2, 1/2]. Recall 
< e < Eq < 1 and that we take N to denote the set of positive integers. □ 

Remark: 

In our various estimates we need to keep v away from zero but want to in- 
clude v — 1 since it is the primary resonance. Thus we require v > 1/2 and 
since e < 1 we require \a\ < 1/2. 

It follows from the Fourier form of (13.261) that it is only possible to have a 
nontrivial normal form, i.e., fi ^ 0, if v is an integer. Thus v — 1 is the 
primary resonance as discussed in the Introduction, justifying the choice of 
K r in (11.31) and (I2.6ip . The resonant normal form at v = k is of the pendulum 
form with 

& = e2 x , x' = -eK 2 n(k; k, AP x0 ) cos(^) . (3.31) 
From Appendix 1X1 we have, for AP x0 = 0, 

? n(h . h - / U-^VnM - Jn+iM] if k = 2n+ 1 , , 
jj(fc,fc,U)-| if k even , [6 - 6Z) 

where x n := (2n + l)Ti and n = 0, 1, ... with Ti defined in (12.541) . Thus, 
for AP x0 = 0, (I3.3ip gives the standard FEL pendulum system (see also 

6' = e2 X , x' = -eK 2 fj(k; fc, 0) cos(^) . (3.33) 

For a general quasiperiodic function with base frequencies 1 and v it is pos- 
sible to have a nontrivial normal form for every rational v and thus v would 
be defined to be resonant if it were rational. 

Since f\ (x) is independent of v it plays no role in Definition [TJ Clearly 
f2{0', v) = if v is NR. We state our NR theorem in Theorem Q] for the A-NR 
case. In fact because of a small divisor problem the theorem will require v 
to stay away from neighborhoods of resonances in order to get an o(l) error 
bound as e — > 0+. We will obtain an 0(e 1_/3 ) bound for (3 £ (0, 1] depending 
on the distance from the resonance by letting A = 0(e^). In the resonant 
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case we will explore an 0(e) neighborhood of the resonance. This will allow 
us to at least partially fill the gap between the A-NR us in the NR theorem 
and the us in the NtoR theorem. The way this occurs will be seen in the 
error analysis in the proofs of Theorems [1] and [2j 



3.3 The nonresonant case and its normal form 

The exact ODE's in the NR case are (I3.10p . fl3.lll) . Clearly they are the same 
in the A-NR subcase. By definition, the NR normal form, i.e., the normal 
form with u NR, is obtained from fl3.1(JI) . fl3.11l) by dropping the 0(e 2 ) terms 
and averaging the rhs over ( holding 8, \ fixed whence, by f)3.29l) . fl3.30p . 

v[=£f 1 (v 2 )=e2v 2 , (3.34) 
i4 = e/ 2 (vi;i/) = 0, (3.35) 

with the same initial conditions as in the exact ODE's, i.e., vi(0,e) = 
6o,v 2 (0,e) = xo an d solution 

«i(C>e) = 2x o eC + 0o , v 2 ((,e)=xo- (3-36) 

The solutions of fl3.34p . fl3.35p with e = 1 play an important role in the 
statement and proof of Theorem [1] and we refer to 

v(.,l) = M.,l),v 2 (-,l)) , (3.37) 

as the guiding solution at (#o>Xo)- Note that the v in (13.371) should not be 
confused with the velocity vector v in (12.31) . 

Our basic result in the NR case will be that \0(() — vi((,e)\ and \x(C) ~ 
v 2 ((,e)\ are 0(e/A) in the A-NR subcase. If A = 0(1) then the error is 
0(e). Putting A into the order symbol allows one to discuss A small, e.g., 
as a function of e. The precise statement is given in §3.5.11 and its proof is 
given in §4.11 

Proposition 1. Let < e < eo < 1 and let u > 1/2. Then 

W(e ) xRc W(e) xlc V(e, u) . (3.38) 

Moreover gx(-; e, u),g 2 (-; e, u) are C°° functions on W(eo) x R. Furthermore, 
for (6,xX)£W(e )xR, 

\im[ 9l (6,xX;e 1 u)} = - q ^( 3 -^ + 12 X 2 ) 
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-^^sin^-QCOD-Bin^o^CcosC + AP^) , (3.39) 
lim [g 2 (9, X ,C,e, u)] = K 2 X cos(iy[9 - Q(C)])(cosC + AP x0 ) . (3.40) 

e— »0+ 

Remark: 

Proposition [1] entails that the vector field on the rhs of (I3.10p . fl3.lip is a C°° 
function on W(so) x K. (whence the vector field on the rhs of (13. 8p . (13. 91) is 
a C°° function on W(eq) x R, too). Proposition [1] will allow us to use, in 
Theorem [TJ the domain W(eq) x K. Furthermore the domain is large enough 
to contain the \ °f physical interest (see Proposition [3] in §3.5.3p . 

Proof of Proposition^ Let (9,x,() € W(e)xR. Then, by fl336|) .f l3~T9|) .f l3~20|) . 

whence, by (13.61) . (9, x, C) ^ T^{ £ i v ) which proves the second inclusion in 
f)3.38p . The first inclusion in (I3.38P follows from ( 13 . 1 9[) and from the fact 
that, by (I3.20p . Xib{z) is increasing with e. Moreover, by the remarks after 
(I3.15p . gi(-;e,b , ),g2(-]£,v) are C°° functions on V(e, v) whence, by (13.381) . 
they are C°° functions on W(eq) x IR. Finally, (I3.39p .f l3.40p are proven in 
Appendix El (see (JR8]) , f lBl3|) ) . □ 



3.4 The Near-to-Resonant case and its normal form 
3.4.1 The Near-to-Resonant system 

According to Definition [1] we have, in the NtoR case, 

u = k + ea, (3.41) 

where fceN and a G [—1/2, 1/2] is a measure of the distance of v from k. 
The 0(e) neighborhood of k is natural in first-order averaging. If \u — k\ is 
too small then the normal form will be close to the resonant normal form 
and if |z/ — k\ is too big, then v will be in the NR regime. Eq. (I3.4ip clearly 
includes the resonant case for a = 0. We start from fl3.10l) . fl3.1ip . fl3.13p use 
f)3.4ip and obtain 

0' = eA( X , + e 2 9l (9, X , C,e,k + ea) , (3.42) 
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X ' = ef 2 (9, (;k + ea) + e 2 g 2 {6, X , C,e,k + ea) , (3.43) 

with initial conditions 0(0, e) = 9 ,x(0,e) = Xo- 

By the remarks after (I3.15p . the vector field in fl3.42IUI3.43p is of class C°° 
on the maximal domain T>(e, k + ea). Since f± in (I3.42p is independent of e 
the normal form associated with it will be the same as in the NR case. We 
now need to study the e dependence of f 2 in (13.431) . From (13.221) . 

K 2 

f 2 (9, (;k + ea) = -— exp(i(k + ea){6 - k + ea, AP x0 ) + cc 

K 2 

= — — exp^[k9 - eaQ) exp(-ik()jj((] k, AP x0 ) 

x exp{iea[6 - T sin C - Ti sin 2(]) + cc , (3.44) 

where we have used from (I3.23P that 

k + ea, AP x0 ) = (cos C + AP x0 ) exp(-i(k + ea) [T sin ( + Ti sin 2(]) 
= jj(C;fc,AP E o)exp(-iea[T siiiC + Tisin2C]) . (3.45) 

For a = the resonant normal form of (I3.30p is obtained in ( I3.44p . For a ^ 
( I3.44p displays two e dependencies. The first is the ea( one which cannot be 
expanded since it is 0(1) for ( = 0(1 je) the upper range of our averaging 
theorem. The second is the ea factor in the final exponential which can be 
expanded and makes an 0(1) contribution to g 2 in ( I3.43P for all (. Therefore 
we rewrite f 2 as 

f 2 {6, (;k + ea) = f 2 R (9, e(, (; k, a) + 0(e) , (3.46) 

where 

K 2 

f 2 (9, t, C; k, a) := — — exp(i[k9 - ar}) exp(-ik()jj((; k, AP x0 ) + cc 

= ~ ex V (i[k9 - ar})J2fj(n; k, AP x0 )e^ n ^ + cc . (3.47) 

We can now write the basic system for the Mo A, in this NtoR case. From 
dS32D - (jS37D we obtain 

0' = ef?( X , C) + e 2 g?(d, X , C, e, k, a) , (3.48) 
X ' = ef^O, e(, (■ k, a) + e 2 g*(9, X , C, e, K «) > (3-49) 
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where 

/1 R (X,C):=A(X,C) = ^|^, (3.50) 

9i(0, X, C, e, fc, a) := X, Cl + ea ) , ( 3 - 51 ) 
X, C, £, k, a) := g 2 (0, x, C^^ + ea) 

+- £ [h{9, C; k + ea)- f*(6, eC, C; k, a)] , (3.52) 

and where gf can be rewritten as follows. By (I3.2ip we have 
/ 2 (0, C-k + ea) = -K 2 (cos C + AP x0 ) 

cos ^(fc + ea) [0 — C — T sin £ — Ti sin 2£]^ , 



(3.53) 



and, by (0,(1133), 



T 

x exp(— zA;[T sin ( + T 1 sin 2(}) + cc 



£ C C; k, a) = — — exp(i[k9 - eaQ) exp(-ifcC)(cos ( + AP x0 ; 



-K 2 (cosC + AP x o)cos^fc[e-C-T sinC-Tisin2C] - ea(J . 

(3.54) 

Using fl333l) .f l33D we can write ( 13321) as 

92(P, X, C, £ > k, a) = g 2 (9, x, (]S,k + ea) 

(cos C + AP x0 ) ( cos f(k + ea)[6-(- T sin ( - Ti sin 2(}^J 



K 2 

e 



- cos \k[6 - C - T sin C - T x sin 2(] - eaCJ J , (3.55) 

which will be useful in obtaining bounds for gf in Appendix [El 

The following proposition is the analogue of Proposition [1] for the NtoR 
case. 
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Proposition 2. Let < e < e < 1 and let a 6 [—1/2, 1/2], fc G N. 77ien 
g^(-;e,k,a),g2{-',s,k,a) are C°° functions on W(eq) x R. Furthermore for 

hm [g*(9,x,Ce,k,a)} = -^(|g(C) + 12* 2 ) 

-^^(^-QCOD-Binf^VcosC + AP^o) , (3.56) 



lim [<7*(0, X , C, e, fc, a)] = X K 2 cos(k[9 - Q(C)])(cos C + AP x0 ) 

£— >0+ 



+K 2 a(6 - T sin C - Ti sin 2Q 

xsin(fc[#-C-T sinC-T 1 sin2C])(cosC + AP x0 ) . (3.57) 



Remark: Proposition [2] entails that the vector field on the rhs of (I3.48p . (l3.49p 
is a C°° function on W(eq) x K. Proposition [2] will allow us to use, in Theo- 
rem EJ the domain W(eq) x M. 

Proof of Proposition^ The C°° property of gi(-; s, k, a), (•; £, k, a) follows 
from Proposition [1] and fl3.5ip . fl3.52p . Moreover ( I3.56j) . f l3~5T]) are proven in 
Appendix |D] (see (JEl , flDllj) ) . □ 



3.4.2 The NtoR normal form 

The NtoR normal form ODE's are obtained from (13.481) . (I3.49P by dropping 
the 0(e 2 ) terms and averaging the rhs over ( holding the slowly varying 
quantities 6,x,ea( fixed. We thus obtain from fl33Z) , fl33B) ,( 133S) , (USD} 
that 

v[ = ef?{v 2 ) = 2ev 2 , (3.58) 
v 2 = £ /if (^l) e C; k) = —eK (k) cos(kv 1 — ea() , (3.59) 

where 

K (k) := K 2 jj(k; k, AP x0 ) , (3.60) 

and the same initial conditions as in the exact ODE's, i.e., v i(0, e) = 0q, t>2(0, e) = 
Xo- For a = 0, eq.'s f l3.58p . fl3.59p become the resonant normal form f)3.3ip . 
For AP x0 = a = 0, eq.'s fl3.58p . fl3.59p are the standard FEL pendulum equa- 
tions, given by f l3.32p . fl3.33p . In the special case when K (k) = the ODE's 
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(j3.58|UI3.59p are the same as NR equations fl3.34p . fl3.35p and so this case 
needs no further comment. Note that the special case K (k) = occurs, 
e.g., when AP x q = and k even (see the remark after ( lA.lip ). 

The ultimate justification for the normal form (I3.58p .f p.59p comes from 
the averaging theorem itself. However, if we replace e( in (I3.49P by r and add 
the equation r 1 = e then this, together with f l3.48p . fl3.49p . is in a standard 
form for "periodic averaging" (=averaging over a periodic function) and the 
normal form f l3.58p .f l3.59p is obtained by averaging over ( holding x, r fixed. 
In this 6, x, t formulation standard periodic averaging theorems apply for the 
3D system of 9, x, t, see, e.g., [6j[T3] and Section 3.3 in [10]. We will however 
prove an averaging theorem directly tuned to fl3.48p . fl3.49p both to show the 
reader a proof in a simple context and in the process we obtain nearly optimal 
error bounds which are stronger than in those standard theorems. 



3.4.3 Structure of the NtoR normal form solutions 

Here we write the solution of the I VP for the normal form system f l3.58p .f l3.59p 
in terms of solutions of the simple pendulum system and discuss their behav- 
ior. Therefore in this Section we exclude the simple subcase where Kq = 0. 
Let v = (t>i,t>2), then it is easy to see that 

v(C > e) = v(eC,l)- (3.61) 
We first make the transformation v(r, 1) — > v(r) via 

♦M=Ulri)-(*t&r)- <M2) 

which gives 

= 2kv 2 - a , #i(0) = k6 , (3.63) 

GST 

^ = -K (k) cost)! , v 2 (0) = xo • (3.64) 

GST 

Thus we have scaled away the e and made the transformed system au- 
tonomous. Solution properties of fl3.63l) . fl3.64l) are easily understood in terms 
of its phase plane portrait (PPP). However it is more convenient to transform 
it to the simple pendulum system 

X' = Y t Y' = -smX, (3.65) 
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A(0; Z ) =: A , Y(0; Z ) =: Y , Z :=( y° ) . (3.66) 



o 

The required transformation is 

v 1 (t) = X(Qr; Z ) - sgn(if )| , (3.67) 
„ {lY({lT;Z ) + a 

w) = git ' ^ ^ 

where 

VI = n(k) := ^/2k\K^(k)\ . (3.69) 

From (13~6T1) .( 13~621) (l3~67l) and (l3~68j) . the solutions of ( l3~58l) . ( l3~59l are repre- 
sented by 

v 1 (C,e) = i , (3.70) 

nY(ne{;Z ) + a 

e) = gib ' ( 3 - 71 ) 

where 

7(9 v kn)-( Xo{6 °> k) \-( k9 ° + S ^ K ^ \ (i 72) 

Z o (0o,Xo,M-^y o(xO)A . a) )-[ {2kxo -a)/m J" ^ 

We now discuss the solution properties of f l3.58l) .( l3~59i) in terms of the 
simple pendulum PPP, [18], for (I3.65P using ( 13 . 70 j) and f)3.7ip . The equilibria 
of (I3.65P are at (X, Y) = (irl, 0) with integer I. 

The systems obtained by linearizing about these equilibria are centers 
for / even and saddle points for I odd. From the theory of Almost Linear 
Systems (see, e.g., [IS]), it follows that the equilibria are centers and saddle 
points for the nonlinear system. A conservation law for the simple pendulum 
system is easily derived by first noting that the direction field is given by 

g = -=^. (3,3) 

This equation is separable and has solutions given implicitly by |F 2 + 1 — 
cos AT = const. Thus 

£ Pen (X, Y) := l -Y 2 + U{X) , U(X) = 1 — cos A (3.74) 
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is a constant of the motion which is easily checked directly. Incidentally Sp en 
is also a Hamiltonian for the ODE's fl3.58p . fl3.59p but this plays no role here. 
The PPP is easily constructed from the so-called potential plane which is 
simply a plot of the potential U(X) vs. X, see [20J. The PPP shows that 
the solutions of the simple pendulum system has four types of behavior, the 
equilibria mentioned above, libration, rotation and separatrix motion. These 
can be characterized in terms of £p en . Clearly, £p en is nonnegative, the 
centers correspond to Sp en (X, Y) — and the saddle points and separatrices 
to £p en (X, Y) = 2. The motion is libration for < £p en (X, Y) < 2, rotation 
for £p en (X, Y) > 2 and separatrix motion for Sp en (X, Y) = 2 with F ^ 0. 
In the libration case the solutions are periodic, which is easy to show, and 
the period as a function of amplitude, [2T] , is given by 

f A dt 

where T(A) is the period associated with the initial conditions Xo = A, Yq = 
0. It is easy to show that lim^o T(A) = 2ir. 

We denote by B n the n-th pendulum bucket which is defined by 

B n := {(X, Y) e M 2 : £ Pe n(X, Y) < 2, \X - 2im\ < tt} , (3.76) 

with neZ. Note that, by (jg772j) . (ET73j) . 



£pen(Z (9 , xo, k, a)) = S R (9 , xo, k, a) 



1 r 2/cxo _ o l2 



2 L n(k) 1 

1 + sgn(K ) sin(^ ) . (3.77) 



Note also that, by (|570|) . (|577I]l .(Er72 ) 

\vi((, £ ) ~ e o\ 



X(ne(;Z )-X + ea( < \X(ne£; Z ) - X \ + e\a\( 



k ~ k 

(3.78) 

K' 2 (C, e) - Xol = 7^\ Y (Ve{; Z ) - Y \ , (3.79) 

l4,)|<Sfffl. (3.80) 

We can now discuss the four cases of equilibria, libration, rotation and 
separatrix motion. In each case, using f l3.78p .f l3~79l . (I3.80p . we will find 
d™ in , d^ in , Xoo > such that, for all ( > 0, 

\vi{(,e)-9 \ < df n (9 ,Xo,e(,k,a) , |^ 2 (C, e) - Xo| < df n {9 Q , X o, K a) , 
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(3.81) 

\v 2 ((,e)\ <Xoo(0o,Xo,M , (3-82) 

and we will at the same time observe that d™ m (6o,Xo,T,k,a) is increasing 
w.r.t. r. 

(I) Equilibria regime: Y — and either £p eri (X , Yo) = or 2. 
Clearly X = nl where I G Z and, by (13.721) . 



(2k Xo -a)/n(k) J UVU ' AU ' ' y \Y J \0 
so that # = (vr/-sgn(Ko(A;))f)Aandxo = a/2k. Thus, bv f l3T7U|) . (IBTTT]) . 

v 1 ((,e) = 9 + £ -^, (3.84) 
v 2 (£,e) = Xo. (3-85) 

Clearly, by direct substitution, these are solutions of f l3.58p . fl3.59p . In- 
cidentally these solutions are stable for I even and unstable for I odd. 

Clearly, due to (13781]) . (13782]) . f l378^|) . fl3T85|) . we can choose 

df n (0 o , Xo, e(, k, a) := ^ , d™^, Xo, k, a) := , (3.86) 
Xoo(0o,Xo,M) := |Xo| • (3.87) 

(II) Libration regime: < £p en (X ,Yo) < 2. 

In this case Z (9 , Xo, k, a) G B n (e ,k) where the integer n = n(9 ,k) 
is determined by the condition \Xa(8o,k) — 2irn(8o, k)\ < it. From 
fl3T70|) . fl3T7Tp we see that 

v(C, e) = v per (C, e) + viin{e() , (3.88) 

and it is easy to show that the periodic part has amplitude determined 
by the max and min values of X and Y and the linear growth term is 

WeC) = ( £< J h ) • (3-89) 
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The maximum values X max and Y max of X and Y satisfy, by ( I3.74p . 

£ Pen (Z ) = iy 2 + 1 - COSX = ^Y^ax = 1 ~ COS X max , (3.90) 



whence 



X max {9 , Xo, k, a) = 27in(6 , k) + arccos(cosX - ^F 2 ) 



= 2irn(9 0l k) + arccos^l — £r(9q, xo, k, a) 
Y max (6 ,Xo,k,a) := ^/2£ Pen (Z (9 , xo, k,a)) 



= y/2S R (9 ,Xo,k,a) , 

(3.91) 

and the minimum values X min and Y min of X and Y are given by 

X m in • 4:7TTl X max , Y rn i n . Y max . (3.92) 
Here arccos is the principle branch of the inverse cos mapping [—1,1] — > 

[0,4 

We now determine d? in , d% in and Xoc- It follows from f l3T7H|) . fl3T79|) . fl3T80|) . 
<^M,^M,<^M,^M that 

\X(ne(-Z )-X \+e\a\( 



\v 1 ((,e)-9 \ < 
< 



k 

2X max (9 Q , Xo, k, a) - 4:7in(9 , k) + s\a\( 



k 

2 arccos ^1 — £r(9 , Xo, k, a)^j + e\a\( 



dr n (9 Q ,Xo,e(,k,a) 



k 

(3.93) 

MC, e) - Xol = ^\ Y (^ £ C, Z ) -Y \< ^Y rnax (9 , X o, K a) 
Q(k) 



k 



y/2£ R (0 , X o,k,a) =: d™"(0 o , Xo, k, a) , (3.94) 



{l\Y{{le(;Z )\ + \a\ QY max (9o,Xo,k,a) + \a\ 

\V2{(,S)\ < T7 < 



2 k 2 k 

o, Xo, k, a) + |a 



2k 



■ Xoo(6'o,Xo, fe,a) • (3.95) 
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(Ill) Separatrix regime: Y ^ and £p en (X ,Y ) = 2. 



In this case (X, Y) e B n (g 0j k) where the integer n = n(8o,k) is deter- 
mined such that \Xq(9q, k) — 2nn(6o, k)\ < it. Clearly 



\X - X \ < 2tt , \Y-Y \< y/2£ Pen (X , Y ) = 2 , \Y\<2. 

(3.96) 

For Y Q > 0, (X(t), Y(t)) -»■ ((2n + 1)tt,0) as t -> oo and, for Y" < 0, 
(X(t), Y(t)) -> ((2n - 1)tt, 0) as t -> cx). Thus for large ( 

„ (£f ) „ 1 ( < 2 " ± 1)* " + £ « ) , (3.97) 

which is the odd / solution in case I. 

We now determine d± in ,d™ in and Xoo- By (13~78|) . (13~79|) . (13T80|) . (ETgT]) . 
(EH2D,(E96D 

|ui(C,e) - So I < 7 

< 27r+ / |aK =: df n (0o,Xo,£C,A:,a) , (3.98) 

MC, e) - Xo\ = ^\Y(fleC\ Zo) -Yo\<^- 

=:d™ n (6 , Xo ,k,a), (3.99) 
{l\Y{Qe(;Z )\ + \a\ 



KC,e)|< 



2A; 

2f2(As) + la] , . , 

< 2k =: Xoo(^o,Xo,fc,a) . (3.100) 



(IV) Rotation regime: £p en (Xo, Yg) > 2. 

For Yq > 0, X is increasing and F is periodic such that 



V2y/£ Pm (X ,Y ) -2<Y< V2^S Pen (X ,Y ) , (3.101) 
and for Y < 0, X is decreasing and Y is periodic such that 



■V2y/£ Pm (X ,Y ) <Y< -V2y/£ Pm (X ,Y )-2 .(3.102) 
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Clearly f 2 (-,e) is periodic. We now determine c?™ n ,c?™ n and Xoo- It 
follows from (I3.101IUI3.102I) that for any choice of Y 



\Y-Y \< V2y/£ R (6 , X o,k,a) - V2y/£ R (6 , X o, k, a) - 2 , (3.103) 
\Y\ < ^2£ R (6^ X o,k,a) . (3.104) 

It follows from ( 13779]) . f lBTHUj) . f[3TTU3j) . f l37TMj) that 

\v 2 {(,e)- X o\ = ^\Y(Qe(;Z )-Y \ 

< ^ (V2y/£ R (6 ,xo,k,a) - V2^£ R (9 , X o,k,a)-2 

=: d™ m (6 , xo, k, a) , (3.105) 

n|y(neC;^o)l + H ^ fl(fc)V2g fl (go,xo,fc,a) + |a| 
MCOI < TTi ^ 



2k ~ 2k 

=■ Xoo(0 ,xo, k,a) . (3.106) 

It follows from (l3T65|UI3.1()4p that 



\X(QeC;Z ) - X \ = | / X'(s)ds\ = | / Y(s)ds\ 

Jo Jo 

< / \Y(s)\ds < s/2 / y/£ Pm (X(s),Y(s))ds 
Jo Jo 



= V2tte( v /£ Pen (X ,Y ) = y/2neCy/£ R {e , Xo, k, a) ,(3.107) 
whence, by (I3.78p . 

|ui(C,e) - m < t 

y/2n(k)e{y/£ R (6o, Xo, k, a) + e\a\{ 
k 

=:d? in (e , X o,e(,k,a). (3.108) 



Clearly the simple pendulum system is central to our NtoR normal form 
approximation. Every student who has taken a course in ODE's or Classical 
Mechanics has studied the pendulum equation at some level. However, not 
every reader of this paper may know the general settings of the equation. 
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So, as an aside, we thought some might be interested in knowing how it 
fits in a broader context. First, the pendulum equation is a special case of 
the nonlinear oscillator x + g(x) = and second, the nonlinear oscillator 
is an important subclass of the class of second-order autonomous systems 
x = f(x,y),y = g(x,y). The nonlinear oscillator is discussed in many texts, 
and here we mention [TH] and [22]. Its PPP is easily constructed from the 
potential plane as mentioned above and in [20J. After the class of linear 
systems, the class of second-order autonomous systems has the most well 
developed theory [23]. Here the qualitative behavior is completely captured 
in the PPP's. What's missing from a PPP is the time it takes to go from 
one point on an orbit to another, but this is easily determined using a good 
ODE solver. The limiting behavior of all solutions bounded in forward time 
is given by the celebrated Poincare-Bendixson theorem and as a consequence 
existence of periodic solutions can be inferred and the possibility of chaotic 
behavior is eliminated. It also follows that a closed orbit in the phase plane 
corresponds to a periodic solution. 



3.4.4 NR limit far away from the pendulum buckets 

Even though for small e there will be gaps in v between the A-NR and NtoR 
cases, as we will discuss in the context of Theorems ITf2] we show here that 
far away from the pendulum buckets the NR normal form emerges. While 
not a rigorous argument since we do not quantify "large" it is a consistency 
check. As in Section 13.4.31 we exclude the simple subcase where K = 0. 

For Z far away from the pendulum buckets in the sense that \Y \ = 
\2kxo—a\/Q 3> 2, we are in the rotation regime. Letting X(s) = X(s),Y(s) = 
Y Y(s), s = Y S, (I3T65D . (13361) become 

dX dY 

—r~ = Y , — = -esinX, X(0) = X , Y(0) = Y = 1 , (3.109) 
ds ds 

where e = 1/Yq. A regular perturbation expansion yields X(s) = s + X + 
0(e), Y(s) = 1 + 0(e) as we show in Appendix |F] therefore X(s) = Y s + 
X + O(l/Y 2 ),Y(§) = Y + O(l/Y ) and thus from fl3~^ . fl3~7T|) . (l3~72|) 

u _ Y Qe( + Xp + O(l/Y 2 ) - sgn(^ (fc))7r/2 + ea( 

= 9 + ™±^ eC + (1/Y Q 2 ) = 2 Xo e( + 9 + O(l/Y 2 ) , (3.110) 
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«a(C, e) = + o(i/y ) = Xo + o(i/y ) , (3.1H) 

consistent with (I3.36p . 



3.5 Averaging theorems 

Recall that we have gone from our basic Lorentz system, fl2.22p - fl2.25p . to 
f l3.10p .f l3.lip with no approximations. We have also derived two related nor- 
mal forms for v > 1/2 in the NR ( §3.3p and NtoR ( §3.4p cases. Here we state 
theorems which conclude that the solutions of these normal form systems 
yield good approximations to the solutions of f l2.22p -( l2~25l) in the appropri- 
ate v domains. 

Our NR theorem in §3.5.11 will cover the A-NR case, i.e., closed subin- 
tervals [k + A, k + 1 - A] of (k, k + 1), where k = 0, 1, < A < 0.5, and 
we will obtain error bounds of 0(e/A) (Here A can be small as mentioned 
in §3.21 and §3.3p . Our NtoR theorem in §3.5.21 will cover the case where 
v = k + ea which includes the resonant v = k case and we will obtain error 
bounds of 0(e). 

3.5.1 A-nonresonant case: v £ [k + A, k + 1 — A] (Quasiperiodic 
Averaging) 

The exact ODE's to be analyzed are (13. 101) . (13.111) with the initial condi- 
tions 0(0, e) = 9o,x(0,e) = Xo an d where /i,/2 are defined by (l3.12p .f l3~T3l) 
and where jj(n;v,AP x0 ) is defined by (I3.25P and gi,g2 by (I3.14p . fl3.15p . 
The normal form ODE's are (I3.34p .f l3.35p with initial conditions v\(0,e) = 
9o,v 2 (0,e) = Xo an d solution (I3.36p . Note that Vi((,e) = Vi(e(, 1). 

We are now ready to state the NR theorem which roughly concludes that 
\9(C,e) - 2 Xo e( - 9 Q \ = 0(e/A) and | X (C,e) - Xo| = 0(e/A) for < C < 
0(1 /e) with e sufficiently small. To make the statement of the theorem 
concise, we now set up the theorem in nine steps. 

(1) (Basic parameters) 

Let < e < e < 1, fix < A < 0.5 and let v £ [k + A, k + 1 - A) 
where k is a nonnegative integer. 

(2) (Initial data) 

Choose O , xo such that (0 o ,Xo) £ x [~Xm,Xm]) where xm > 
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is chosen such that — xm > X/ft( £ o) where xib is defined by (13. 20 p . 
Clearly (R x [~Xm,Xm}) C W(e ) where W(e ) is defined by (1X19]) . 
Note also that, by (13. 36ft . the corresponding guiding solution v(£, 1) = 
(2xoC + #0, Xo) belongs to (R x [~xm, Xm]) for all ( G [0, oo). 

(3) (Guiding solution) 

Choose T > and define the compact (=closed and bounded) subset 

S := {v(r,l) : r G [0,T]} = {(2 Xo r + 0„, Xo) : r G [0,T]} (3.112) 
of (R x [~Xm,Xm]) C W(e ). Recall that v(C,e) = v(eC, 1). 

(4) (Rectangle around initial value (9q, Xo) : the basic domain for averaging 
theorem) 

Let W(9o, Xo, di, d 2 ) be the following open rectangle around S where 

W(6 ,Xoid 1 ,d 2 ) := (9 -d 1 ,9 + di) x ( Xo - d 2 , Xo + d 2 ) , (3.113) 
where 

2| Xo |T <d x , 0<d 2 < X o- Xib{eo) ■ (3.114) 

Note that the closure, W(9 , Xo, d\, d 2 ) = [9 — d\,9 + di] x [xo — 
d 2 , Xo + d 2 ], of W(9 , Xo, d\, d 2 ) is compact and that, by (j3.19p . (l3.112p . 

dSHSD.dSIIl, (^0, Xo) e S C W(9 , X o,dx,d 2 ) C W(0o,Xo,di,d 2 ) C 
W(sq). Thus, by Proposition [T] in §3.31 the vector field of the ODE's 
fl3TT(l.fl3TTll is C°° on W(9 ,xo,d 1 ,d 2 ) x R. 

(5) (Restriction on e ) 

Choose Eo so small that Xib( £ o) < ~Xm — d 2 . Note that this is made 
possible since, by ( I3.20p . 

Xib(eo) < ~ + ^=yJl + K*I% tVb (l) , 

whence Xib^o) < ~Xm ~ d 2 if 

£ <Cj=y/l + Km% ub (l) + xm + d 2 ^j . (3.115) 

Since the RHS of ( 13. 115ft is positive eo can indeed be chosen sufficiently 
small. 
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(6) (Exact solution in rectangle) 

Since the vector fields in (I3.10IUI3.11I) are C°°, solutions in W(9 , Xo, di, d 2 ) 
with initial condition 0(0, e) = 6> , x(0, e) = Xo exist uniquely in W(9 , Xo, d\. 
on a maximum forward interval of existence [0, /3(e)). Here di,d 2 sat- 
isfy f)3.114p . Either /3(e) = oo or the solution approaches the boundary 
of W as C — ► /3(e)— . See Chapter 1 of [21] for a discussion of exis- 
tence, uniqueness and continuation to a maximum forward interval of 
existence. 

For convenience we define I(e,T) := [0, T/e] fl [0, /3(e)). 

(7) (Lipschitz constants for f\, f 2 on rectangle) 
Let Li, L 2 be defined by 

2 2 if 2 X 2 

Li := -max (m27r] \q(()\ = 2[1 + — |AP x0 | + — ] , (3.116) 
q q 2q 

L 2 :=vK 2 (l + \AP x0 \) . (3.117) 

It follows by (J3I2D,03I3]), fl3TTT6|) . f l3TT7j) and for X , 2 , Xi, X2, C e », 
that 

|A(X2, - /lOti, 01 < ^^1X2 - Xi| < Li\X2 - Xi\ , (3.H8) 

q 

\M02,(;v)-f2(ei,(;v)\ 

= K 2 \ cosC + AP x0 | I cos(z/[£ 2 - Q(C)]) - cos(i/[0i - Q(C)])| 

< if 2 (l + \AP x0 \) \v[6 2 - Q(0\ - v[9i - Q(C)}\ 

= vK\\ + |AP X . |) \9 2 -9 X \ = L 2 \9 2 -9 X \, (3.119) 

where we have also used the fact that | cosx — cosy| < \x — y\. Thus 
Li, L 2 are Lipschitz constants for f ly f 2 on W(9 , Xo> di, d 2 ) respectively 
(in fact even on R 2 ). 

(8) (Bounds for g±,g 2 on rectangle) 

Appendix [C] gives a very detailed derivation of quite explicit minimal 
bounds for g x and g 2 . There we show, for (9, x, C) i n W(9 , Xo, d±, d 2 ) x 



\gi(9,X,(,£,v)\ < Ci{xo,SQ,v,d 2 ) , (3.120) 

where i — 1, 2 and d\, d 2 satisfy (I3.114p and where the finite C\ and C 2 
are defined by f lU^7|) .f lU30|) . 
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(9) (Besjes terms) 

Let Bi, B 2 be defined by 



B i(0 : = I / fi(v 2 (s,e),s)ds\ = | f fi(xo, s) ds\ , 
Jo Jo 

B 2 {C)-=\f h(vi(s,e),s;v)ds\ = \ [ h{2 X oes + 9 , s; v)ds\ 
Jo Jo 



(3.121) 

where 

/1O2, s) := fi(v 2 , s) - Uv 2 ) = 2(# - l)v 2 , 

Q 

f 2 {v x , s; v) := f 2 (v x , s; v) - f 2 {v X ] v) = f 2 {v u s; v) . 

(3.122) 

In ( 13.1211) we have used ( 13.361) . We will also need -Bi 00,-82 00 defined 
by 

A,oo(C) := sup Bi{ 8 ) , (3.123) 
•e[o,0 

for i = 1,2. 

We refer to Bi, I?2 as "Besjes terms" and their importance will be seen 
both in the bounds presented in Theorem [1] and in the proof of the the- 
orem where they eliminate the need for a near identity transformation 
(for the latter, see jSJ El Ed EH E2] ) . 

With this setup we can now state the NR approximation theorem. 

Theorem 1. (Averaging theorem in A-NR case: v £ [A; + A, A; + 1 — A], 
k = 0,1,...,0 < A < 0.5; 

With the setup given by items 1-9 of the above preamble we obtain, for Q £ 
I(e,T), that 

\9((,e)-2 Xo e(-9 \=0(e/A) , \ x ((,e) - X o\ = 0{eJA) .(3.124) 
More precisely 

|0(C,e) - 2 Xo e( - 9 \ <e([B 1)0O (T/e) + C.T] cosh(T 'y/IJ^) 
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+ [B 2j00 (T/e) + C 2 T}^smh(T^Lj7 2 )^ , (3.125) 

\x((,e) ~ Xo\ < e([B lt00 {T/e) + C^^^T^L^) 

+[B 2>00 (T/e) + C 2 T] cosh(T ^/LjT 2 )j . (3.126) 

Moreover 

B 1>00 {T/e) < Bt , B 2>00 {T/e) < B 2 (T, A) , (3.127) 

where i — 1,2 and the Bi,B 2 (T, A) G [0, oo) are finite, e -independent and 
are defined in terms of our basic parameters and initial conditions by 

£i := ^^(2\AP x0 \ + \) , (3.128) 
q 4 

B 2 (T, A) := ^J3 21 (T) + £? 22 (T) , (3.129) 



B 21 (T) := 2iv 2 [l + + l)\ X o\T][\jj(k;v, AP x0 )\ + \jj(k + l;u,AP x0 ) 

(3.130) 

B 22 (T) :=2K 2 (l + (k + l)\ X o\T\ ^ |5'(n; x/, AP x0 )\ . (3.131) 

^ ' n&(Z\{k,k+l}) 

Furthermore, for e q sufficiently small, (9((, e), x{Cj £ )) stays away from the 
boundary of the rectangle W(9o,xo,di,d 2 ) for ( G I(s,T). Thus the ODE 
continuation theorem (see [Eft Section 1.2]) gives (3(e) > T/e, hence I(e, T) = 
%T/e\. 

The proof of Theorem [1] is presented in §4.11 Note that the symbol 
0{e/A) conveys that the error contains the factor ^. 

3.5.2 NtoR case: u = k + ea (Periodic Averaging) 

The NtoR case was defined in §3.21 The exact ODE's to be analyzed in this 
case were derived in §3.41 and are given by ( I3.48p .( l3.49p with initial conditions 
^(O,^) = # ?x(0,£) = Xo an d where gi,g 2 are defined by (I3.5ip .( l3~52|) and 
fi, f 2 by (I3^7|) . (I33D1 . The normal form ODE's are (13381 . ( 13391 with initial 



44 



conditions vi(0,e) = 9 Q ,v 2 (0,e) = Xo solved by ( I3.70p . fl3.7ip . where X,Y 
satisfy the standard pendulum equations ( I3.65P with the initial conditions 

0322]). 

The setup for the theorem is as follows. 

(1) (Basic parameters) 

Let < e < So < 1, a G [—1/2, 1/2] and A; be a positive integer. 

(2) (Initial data) 

Choose # ,Xo such that (# ,xo) e(R x [— Xm,Xm\) where xm > is 
chosen such that ~xm > Xib(^o)- Clearly (E x [-Xm,Xm]) C W(e ). 

(3) (Guiding solution) 

Choose T > and define the compact subset Sr '■= {v(t, 1) : r G 
[0,T]} of W(e ) where v = (yi,v 2 ) with v x ,v 2 given by (157701) . (15771]) . 
Note that C W(e ) holds for arbitrary T > if 

Xib(eo) < Xo - ^ m (#o, Xo, k, a) (3.132) 
since |u2(t, 1) — Xo| < d 2 nm (9 , xo, a) where c?^ m is defined in §3.4.31 

(4) (Rectangle around initial value (6*0, Xo) : the basic domain for averaging 
theorem) 

Define an open rectangle Wr(9 , Xo, d±, d 2 ) around Sr by 

W R (8 , xo, d h d 2 ) := (9 - d u 9 + d\) x (xo - d 2 , Xo + d 2 ) , (3.133) 

where di,d 2 satisfy 

0<dT m (9 0l Xo,T,k 1 a)<d 1 , (3.134) 
< dr n (9 , xo, k, a)<d 2<X o- Xib(s ) , (3.135) 

with d™ n ,d™ n defined in 3574751 Note that (157135]) entails ( 157132]) . 
Note also that, by (15751]) . f[57173~i]) . (15155)1 . 

|«i(r, 1) - 0„| < < m (^o, Xo, r, fc, a) < df n (^o, Xo, T, fc, a) < d x , 
\v 2 (r, 1) - Xo| < d 2 nm (9 , xo, fc, a) <d 2 , 

(3.136) 

where we also used that <i™ m (#o, Xo, r , k, a) is increasing w.r.t. r. It 
follows from ( I3.133p . (l3.136p that (9 , Xo) £ Sr C Wr(9 , Xo, d±, d 2 ) and, 
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by (l3~T9l . ([57i52]) that W R (9 , Xo , d x , d 2 ) C W(e ). Thus, by Proposition 
M in §E31 the vector field of the ODE's (15^481 . (15^91) is of class C °° 

on Wr(0o, Xo, di, d 2 ) x R. Note that the closure, Wr^o, Xo? ^i, d 2 ) = 
[9 - di, 9 + dx] x [xo - d 2 , Xo + d 2 ], of W R (9 , xo, d x ,d 2 ) is compact. 

(5) (Restriction on e ) 

Choose e so small that Xib(^o) < ~Xm — d 2 . Recall from item 5 of the 
preamble to Theorem [1] that such a choice is always possible. 

(6) (Exact solution in rectangle) 

Since the vector fields in ( I3.48[) . (I3.49j) are C°°, solutions in W(9 , Xo, d x , d 2 ) 
with initial condition 9(0, e) = 9 ,x(0,e) = Xo exist uniquely on a 
maximum forward interval of existence [0,(3(e)). Here di,d 2 satisfy 
( I3.134l) .( f3.135p . Either (3(e) = oo or the solution approaches the bound- 
ary of W as ( — > (3(e)— . See Chapter 1 of [24] for a discussion of exis- 
tence, uniqueness and continuation to a maximum forward interval of 
existence. 

It is convenient to introduce I(e,T) := [0,T/e] fl [0,(3(e)). 

(7) (Lipschitz constants for f^, f 2 on rectangle) 
Let Lf , L2 be defined by 

2K 2 K 2 
Lf := L x = 2[1 + —I AP x0 \ + — ] , (3.137) 
q 2q 

L*:=K 2 k(l + \AP x0 \), (3.138) 

where we have also used (13.1161) and where d\ , d 2 satisfy (I3.134p , (I3.135P . 
It follows by <^m>,I^M>, (l5TTg]) . (l37[3TD . (133581 and, for X , 9 2 , X i, X2, C e 



I A R (X2 , C) - f?(xi , 1 = I h (X2 , C) - h (xi , 1 

<L 1 \x2-Xi\=L[ H \x2-Xi\, (3.139) 

|/MX,C;M)-/*(0iX,C;M)l 

= K 2 \ cosC + AP x0 \ cosl k[9 2 — ( — T sin£ — Ti sin2£] — ea( 



— cos yk[9i — ( — T sin ( — T x sin 2Q — ea( 
< kK 2 (l + \AP x0 \)\9 2 - 0i I = Lf\6 2 - 9 X \ , (3.140) 
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where we have also used the fact that | cosx — cosy| < \x — y\. Thus 
Lf , L% are Lipschitz constants for on Wr(9q, Xo ? di, d 2 ) (in fact 



even on M 2 ) 



(8) (Bounds for on rectangle) 

Appendix [E] gives a very detailed derivation of quite explicit minimal 
bounds for gf and g%- There we show that, for (6, x, C) £ Wr(8q, Xo? ^i, ^2) x 
E, 

(0, x, C, e, fc, a) I < Cf (xo, £0, 4) , 
10^(0, X, C, e, fc, a) I < (6>o, Xo, £0, a, di, d 2 ) , 

(3.141) 

where i — 1, 2 and rfi, g?2 satisfy (I3.134p .f l3.135p and where the finite C-f 
and Cf are defined by (lE ^ . dETl]) . 

(9) (Besjes terms) 

Let Bf- , be defined by 



where 



^f(C) H / /?(t*( a) e) >a )<k|, 
./o 

S 2*(C) := I / /?(ui(s,£),£s,s;fc,a)ds| 
./o 



./?(X,«) :=f?{x,s)-fi{x), 

f?(6, es, s; k, a) := / 2 B (0, es, s; fc, a) - (0, es; fc) 



(3.142) 



(3.143) 



We will also need B^, B^ defined by 



B^(0 := sup Bf(s) , (3.144) 

»6[0,0 



where i = 1,2. 



We refer to B^, Bl? as "Besjes terms" and their importance will be seen 
both in the bounds presented in Theorem [2] and in the proof of the the- 
orem where they eliminate the need for a near identity transformation. 
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With this setup we can now state the NtoR approximation theorem. 

Theorem 2. (Averaging theorem in NtoR case: v = k + ea, < e < Eq, 
fcGN, \a\ < 0.5; 

With the setup given by items 1-9 of the above preamble we obtain, for ( G 
I(e,T), that 

\9{(,e) - Vl (C,e)\ = 0(e) , \ X {(,e) - v 2 ((,e)\ = 0(e) . 
More precisely 



1(9(0 - «i(C,e)| < e[[B^(T/e) + C*T) cosh(Ty LfL^) 

+ [^00 (T/e) + C'fT]J^sinh(T V / LfLf)) , (3.145) 



|X(0 - v 2 (C,e)\ < e [B«(T/e) + C«T\ J -| sinh(T J L*L* 



R 
J _2_ 



+ [B^(T/e) + CfT] cosMTy/ZfL*)) . (3.146) 

Moreover 

B&oCT/e) < Bf-(T) , (3.147) 
w/iere z = 1, 2 end £f (T) G [0, 00) are independent of e and defined by 

IK 2 1 / 

B\(T) := -— [2| AP X . | + -] f Xoo(^o, Xo, fc, a) 

+K 2 T|jj(^,AP x0 )A , (3.148) 



B 2 (T) := (2 + T [|a| + 2A; Xoo (fl , X o, fc, a)] 

x y M^m^. (3.149) 

^ \n-k\ ' 

nSZ\{fc} 1 1 

Furthermore, there exists an < e$ < 1 stzc/i thatforO < e < Eq, (0((, e), x((, £)) 
stays away from the boundary of the rectangle Wr(0 o , Xch di, ^2) for ( G 
I(e,T). Thus the ODE continuation theorem (see \2~^ Section 1-2]) gives 
(3(e) > T/e, hence I(e,T) = [0,T/e]. 

The proof of Theorem [2] is presented in §4.21 
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3.5.3 Remarks on the averaging theorems 



(1) We have now explored the 9, x dynamics as a function of v in the A-NR 
case and v = k + ea in the NtoR case. However asymptotically there 
are gaps for v e [k + ea, k + A) when e is small. For A = 0(e) the 
NR normal form breaks down because the error is 0(1), however we 
can come close to the NtoR neighborhood by letting A = 0(e 13 ) with 
/3 near 1 however the error in the NR normal form does deteriorate to 
0(e 1 ^ 13 ). It could be interesting to explore the dynamics in these gaps. 

(2) Important for the functioning of the FEL is knowledge of the fraction 
of the bunch that occupies a bucket. From the analysis in §3.4.31 this 
occurs for ICs in the libration case, i.e., < Sp en (Z ) < 2 where Z is 
given in (I3.66p . One can thus determine the set of (9o,Xo) f° r which 
Z occupies the pendulum buckets. For more details on the pendulum 
motion and its impact on the low gain theory see §3.71 

(3) Mathematically we want to make sure the buckets are covered by our 
domain W(eq) x M for physically reasonable xo- From (I3.7ip the range 
of the w 2 -values in the buckets for the NtoR normal form is the interval 
(~ f + + N° w a > —1/2 so, for every k, the smallest v 2 in a 
bucket is — ^ — jr whence, since k > 1, the very smallest v 2 in a bucket 
is —Q — 1/4. Thus requiring 



entails that Xb is smaller than any %-value inside the buckets and 
smaller than any %-value on the separatrix. It is plausible to restrict 
the physically interesting x-values to be greater than, say 3xb- The 
condition that (9, 3xb) £ W(eo) entails that the buckets are covered by 
W(eo) and that eo satisfies the constraint 3xb > Xibi^o)- The following 
proposition is a corollary to Propositions [Tf2l 

Proposition 3. Let < e < e where < e < 1 and v 6 [1/2, oo). 
Let also A7 be a positive constant and let 




(3.150) 




-1 



(3.151) 



//^effi satisfies the condition: 



1 < 7 C - A7 < 7c(l + ex) < 7c + A 7 , 



(3.152) 
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then 



X > Xw(£o) • (3.153) 

In other words if Eq satisfies A3.151\) then the 7 values in [7 C — A7,7 C + 
A7] are covered by W(eq). 

The proposition guarantees, by choosing a sufficiently small Eq, that 
the domain W{eq) x M is large enough to contain the physical relevant 
values of 9, x, C- 

Proof of Proposition \B Let x G K satisfy (13.1520 . Then, by f TOj) . 
X e h^A T , ^A 7 ] whence, by ®JQ , , dSIMD , 

1 1 



<-± + ^^W<-^<x, 

which entails (I3.153p . □ 

Note that the condition: 1 < 7 C — A7 in (13.1520 is not used in the proof 
of Proposition [3] but serves to guarantee that x satisfies the physical 
condition: 7 > 1, i.e., 1 < j c (l + ex)- 

(4) In applications of Theorems [T|2| T should be chosen so that z G 
[0, T/sk u ] is the domain of interest, e.g., so that T/(ek u ) is the length 
of the undulator. 

(5) In many discussions of this nature, researchers often just assert the 
existence of bounds, for example by using the well known fact that a 
continuous function on a compact set is bounded, or bounds are ob- 
tained which are crude. Here we wanted to do more. By using, in the 
proofs of Theorems [1] and [21 a system of differential inequalities instead 
of the Gronwall inequality we have been able to use two Lipschitz con- 
stants in each proof instead of their maximum and in a similar manner 
can treat the two Besjes' terms independently as well as the compo- 
nents of g and g R . Furthermore, we believe the Besjes bounds and the 
bounds on gi,g2, gi, g2 are nearly optimal. 
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We also note that there are only 3 restrictions on the size of Eq and 
thus e. The first is that we require Sq < 1. But this is only a matter 
of convenience and is really no restriction at all since the averaging 
theorems are only useful for e small. The second restriction is in item 
5 of the preambles to the two theorems, however as indicated there 
this is not a significant restriction. Thus the only real restriction is 
keeping the solution away from the boundary of W, Wr in order to 
obtain I(e,T) = [0, T/e]. This is an optimization problem; by making 
W, Wr larger, e can be larger, however this is compensated to some 
extent in the Lipschitz constants as well as the bounds on gi,g 2 , <?f , 9 2 
which would become larger. Nonetheless, the situation is quite good in 
comparison to say KAM or Nekhoroshev theorems (see e.g., [5]), where 
the restrictions on e are quite severe and it is with great effort that the 
restrictions on e have been improved in some applications, e.g., solar 
system problems. 

(6) We here clarify the contributions of jj to the error bounds of Theorems 
[T]and|2]by finding simple upper bounds for B 21 (T), B±(T), B 22 {T) and 
B?(T). First of all we note from (EJ23J and flSZSD that 

\fj(n;u,AP x0 )\ < 1 + |AP, | , (3.154) 

where v > 1/2. Clearly f !3.154p gives upper bounds for B 2 i(T), B^(T) 
in (I3.130p .( f3.148p . Secondly, we obtain from the Cauchy-Schwarz in- 
equality that 

/ \ 1/2 / 1 \ 1/2 




(3.155) 



where the finiteness of the rhs follows from the fact that the function 
v, AP x0 ) is of class C°°. Since v, AP x0 ) is also 27r-periodic we 
can apply Parseval's theorem to get 
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(3.156) 

It also follows from ( I3.23P that 
d 



d ^jj(C^, AP x0 ) = -exp(-zz/[T sinC + Tisin2C]) ( sinC 
+w(cos C + AP x0 ) [T cos C + 2Ti cos 2(] 



whence 

|ij(C; i/, AP x0 )| 2 < 1 + ^ 2 (1 + |AP,o|) 2 [|T | + 2T X ] 2 , 
so that, by (133551) .f l3TT56ll . 

1/2 



V |S(n; i/, AP x0 )| <^=(l + v\\ + | AP a . |) 2 [|T | + TtA 
o^nez V 3 V / 

(3.157) 

which entails, by (I3.154p . 

\fj(n; v, AP x0 )\ < 1 + (AP^ol + ^ \fj(n; u, AP x0 )\ 

n£(Z\{k,k+l}) O^neZ 

1/2' 



(3.158) 



< 1 + |AP x0 | + -j= ( 1 + u\l + |AP X . |) 2 [|T | + 2Tu 



Clearly (13. 158j) gives an upper bound for P 22 (T) in (I3.13ip . Moreover, 
by fl3~T54j) . (133571) . 

£ li3( T;-tr 0) ' - l^(Q;fe ; A^. )|+ £ I^,ap, )| 

neZ\{ifc} O^neZ 

1/2 



< 1 + |AP x0 | + -j= [I + v\\ + lAPa-ol) [|T | + 2Ti] 

which gives an upper bound for B^iT) in ( I3.149p . 
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3.6 Approximation for the phase space variables in 
d231)-([235]) 

Here we discuss the approximate solutions of f l2.22p - fl2.25p and (I2.29P in terms 
of the normal form approximations given in (13 . 36p . (13 . 70 p , ( 13 . 7Tj) , namely 

{2xot + do NR case 

^X(Qr; Z ) - sgn(K (k))n /2 + ar^j /k NtoR case , 

(3.159) 

and 

Xo NR case 

Xnf(t) := { _ , , \ „ + _ (3.160) 



(W(fir; Z ) + aj /2Jfe NtoR 



case 



where Kq is given in f)3.60p and Q in (I3.69P . Recall from Theorems Q] and [2] 
that 

f(C,e) = Me0 + O(e), (3.161) 
x(C,e)=XNF(e() + 0(e), (3.162) 

for C G /(e,T). From ^,§33}, (J23T]) , 

*(C, e) = J| (C - ^ct(CAj) + 0(C) , (3-163) 

and from ( 12. 36ft 

7(CA«)=7c(l + ex(C > e))- (3-164) 

Now we can determine the approximate solution of (I2.22p -f l2.25p and 
f)2.29p . From ( I3.16ip . fl3.163l) the arrival time, t(z), of a particle at z is given 
by 

t(z) = Z -- ^jj- (e NF (ek u z) - Q(k u z) + 0(e)j . (3.165) 
Furthermore from (11.51) . f l3~162p . ( 13. 164|) the energy in (I2.29P is given by 

7 (z) = VS(^ + X N F (ek u z) + 0(e)) , (3.166) 
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and is clearly slowly varying. From fl2.37p .( l3~T]) . fl3.2p we have 

p x (z) = mcK[cos(k u z) + AP x0 + 0(e 2 )} . (3.167) 
It is tedious but straightforward to derive from ffT3|) . (1237]) . (I3~T|) . ( l3T2|) . f l3~T66l) 



V^Q + Xnf(sKz) + 0(e) \ . (3.168) 



p z {z) = mc 

Finally we can now determine x(z). From f l2.22p . fl3.167p and (I3.168P 

d p x (z) 
-x(z) 



dz p z (z) 

mcK[cos(k u z) + AP x0 + 0(e 2 )} J / ^rncVS Q + Xnf(eKz) + 0(e) ^ 

g (X/yg)[cos(M + AP g0 + Q(e 2 )] 
1 + exnf{£Kz) + 0(e 2 ) 

(cos{k u z) + AP x0 + 0(e 2 )) ( 1 - £X;vf(£M + 0{e 2 ] 



= -=[cos{k u z) + AP x0 ][l - £XiVF(eA; u z)] + 0(e 3 ) . (3.169) 

Integrating ( 13. 169j) gives 
x(z) = x(0) 

eK ( sm(k u z) . „ /" z r . . „ , , , . . \ 

+ — I — h zAP x0 - e J [cos{k u s) + AP xQ \xNF{ek u s)ds\ 

+0(e 3 z) . (3.170) 

For e sufficiently small, I{e,T) = [0,T/e] and then fl3TTB5]) -f l3TTBg|) and 
f l3~T70l hold for < k u z < T/e. 

3.7 Low Gain Calculation in the NtoR regime 

Low gain theories in [21 El H] are done in the context of the pendulum equa- 
tions, i.e., f l3.58p . fl3.59p with a = 0, AP x0 = 0, and k = 1. Here we will not 
make those assumptions and we define the gain by 



G(C, e) := e(v 2 (C, e) - X o)e = ^K, 1) - Xo)e , (3.171) 
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where v 2 is given in f !3.7ip and ( ) g denotes the average over 9q. This is 
consistent with [21 El S] - 

The gain G could be calculated numerically using a quadrature formula 
and an ODE solver, however standard treatments calculate it perturbatively 
using a regular (and thus short time) perturbation expansion. We could do a 
regular perturbation expansion in f)3.58p .f l3"39~j) by letting Vi = Ylt=o £kj ^ik + 
0(e 5 ) and using Grownwall techniques to make the 0(e 5 ) error rigorous (see 
[25l p. 594] for an example of a regular perturbation theorem at first order 
and its proof). However at the fourth order needed here this would be quite 
cumbersome. Because of the special scaling structure in (I3.58p . fl3.59p as given 
in f !3.6ip we can use a Taylor expansion. For e = 1 we get from f l3.58p .f l5.59p 

v' 1 (;l) = 2v 2 (;l) , V 1 (0,l) = 9 , 

<4(-> !) = -K Q {k) cos(Awi(-, 1) - ar) , v 2 (0, 1) = % , 

(3.172) 

and we expand v 2 (-,l) about r = so that 

4 - .5 f i 



«2(t, 1) = XO + 



£ ^ fc) (0,l)r fc + ^ jf {l-tfvf\tTA)dt. (3.173) 



From (1G.6I) in Appendix [G] we have 

4(0,1) = —K (k) cos(^o) , 

^(0, 1) = K (k){2k Xo - a) sin(fc0 o ) , 



<(0, 1) = K Q (k) l-kK Q (k) sm(2k9 ) + [2kxo ~ a} 2 cos(k9 
v' 2 "'{0, 1) = ifo(A;) ^2A;K (A;)(2A;xo - a){sm 2 (k9 ) - 3cos 2 (^ )] 

-[2k X o - a] 3 sin(^ ) J . (3.174) 



It follows from fl3.173p . fl3.174p that the average over 9 leads to 



(v 2 (r, 1) - X o) eo = ¥ <'(0, 1), + 0(r 5 ) = _-/c^(/c)[2A; Xo - a] + 0(r 5 ) , 

(3.175) 
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which gives, by f )3.17ip . 



5M 



G((,e) = e(v 2 (e(, 1) - X o) 6o = —^kK£(k)[2k Xo - a] + 0(e b ) . (3.176) 

This shows the effect of a and k on the gain. 

We now compare our gain formula in (I3.176P with the corresponding 
calculation in [2], where a = 0, AP x q = 0, and k — 1. From our NtoR normal 
form system (I3.58p .f l3.59p and letting 9 = v\ and t] = ev 2 we obtain the IVP 

9' = 277 , 9(0) = 9 , (3.177) 
rf = -ecos9 , r/(0) = exo =■ Vo , (3.178) 

where e = e 2 K (l). The procedure in [2] is a regular perturbation expansion 
in e that does not assume that t] is small. Proceeding as they do, we write 

0(C, e) = 9°(() + eO\C) + ^ 2 (C) + 0(e 3 ) , (3.179) 
v(C, c) = ?AC) + ^(0 + eV(C) + 0(e 3 ) . (3.180) 

We find 



v°(0 


= Vo , 


(3.181) 


o°(0 


= 2VoC + $o , 


(3.182) 


v\0 


= —[sin 6» - sin(2?7oC + 8 )] , 

= — {Csin^o + — [cos(2?7oC + 8 ) - cos6» ]} , 
Vo 2 Vo 

1 f< 

= — / dt sm(2r] t + 9 ){t sin 9 
Vo Jo 


(3.183) 




(3.184) 


v 2 (C) 





^[cos(2r/ t + 6 ) - cosflo]} • (3.185) 
2?7o 



It follows that V 1 (()e = an d 

= T- [ (tcos2r] t - sm2 Vo t)dt . (3.186) 
^^o Jo -^O 

We can rewrite (13. 186j) as 

^ = T^O 2 ■ r:= '* c - (3 ' 187) 
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and the gain becomes 



G(C,e) = ey(C) 0o = e *K*(l)\<«±(^) 2 , (3.188) 

consistent with [2] . For 770 small, which is required by our averaging approx- 
imation (since 770 = exo and xo = 0(1)), we obtain from (I3.186P that 

W)e = ^ / V^oi 3 + O( V0 tr}dt » -^ C 4 . (3.189) 
It follows from ( 13TT88D . (13TT89D that 

G(C,e) » -e 2 ^ C 4 = -^i<- 2 (l) X o , (3.190) 
o o 

as in ( 13.176ft with a = and k = 1. 

Thus we see that (I3.176P is consistent with the standard gain formula 
for r = 770C small. The 0(e 6 ) error in ( I3.176P can be made precise by esti- 
mating the remainder term in ( I3.173p . However, we cannot justify the gain 
formula either in (13. 176p or in (13. 188ft in the context of our Lorentz system 
in (12.22ft - ( 12.25ft . because our NtoR normal form approximation only gives 
an approximation to 0(e). Thus a justification of the gain formulas, based 
on our Lorentz system, would need to come from elsewhere, e.g., a numerical 
calculation based on f 13 . 8 j) and (13. 9p . 

4 Proof of averaging theorems 

In §4.11 we prove the NR theorem, Theorem 1 of §3.5.11 and in §4.21 we prove 
the NtoR theorem, Theorem 2 of §3.5.21 

4.1 Proof of Theorem [I] (Averaging theorem in A-NR 
case) 

Here we compare solutions of the exact I VP (I3.10p .( l3.1ip : 

9' = ef 1 (xX) + e 2 gi(9, X ,C,e,v) , 9(0,e) = 9 , (4.1) 
X ' = ef 2 (9, C; v) + e 2 g 2 (9, X , C; e, v) , x(0, e) = Xo , (4.2) 
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where 

A(x ,0 = ^, (4.3) 

q 

C; v) = -K 2 (cos( + AP x0 ) cos(u[9 - Q(£)]) 

= -^-e^^jjin; u, AP x0 )e^-^ + cc , (4.4) 

with the normal form IVP of (EP4j) . ( ET55]) : 

«i = e/i(u 2 ) , vi(O,e) = 0„, (4-5) 

4 = e/2(vil ^) , ^(0, e) = Xo , (4-6) 

where 

/iW = 2^, / 2 («i;i/) = 0, (4.7) 

for v E [k + A, k + 1 - A]. 

Subtracting and integrating, we obtain from A3. 1220 . (Ojl .flOl.flOjl.lOjl 
that 

0(C,e) -vi(C,e) = e / [A(x(«, e), s) - /i(v 2 (s, e), s) 
Jo 

+A(t> 2 (s,£),s) - /i(u2(s,e)) +e^i(0(s,e),x(s,e),s;e,i/)] ds 



£ 



c 



[/i(x(s,e),«) - /i(v2(s,e),s) 



+/i(Xo,s) + e^i(0(s,e),x(s,e),s;e,i/)J <is , (4.8) 

and 

x(C,e) -«2(C,e) =e / [M^s, e), s; 1/) - / 2 (ui(s, e), s; 1/) 
Jo 

+/2(«i(s, e), s; 1/) + eg 2 (9(s, e), x(s, e), s; e, 1/)] rfs 

= e / [/ 2 (0(s,£),s;z/) - f 2 {v 1 {s,e),s;i>) 
Jo 

+f2(vi(s, e), s; v) + eg 2 {0{s, e), x{s, e), s; e, z/)] , (4.9) 

for £ G I(e,T) = [0,T/e] fl [0, /3(e)). Important for our analysis below is 
that the points (0(C, e), x( s ; £ )) an d ( u i( s ; £) ; w 2(s, e)) belong to the rectangle 
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W(9q, Xch di, d 2 ) for ( G I(e,T). Note that we have added and subtracted 
fi( v 2(s, e), s) in f )4.8p and f 2 {vi{s,e),s]v) in (14. 9p . an idea introduced by 
Besjes [15] (see also [13]). 

Taking absolute values, applying the Lipschitz condition on W(9 , xo, d\, d 2 ) 
and defining 

e 1 (s):=\e(s,e)-v 1 (s,e)\, (4.10) 
e 2 (s) := \ X (s,e) -v 2 (s,e)\ , (4.11) 

gives, by (!33l^ .( l33lT|) .f l3320|) . (l3~T2Tj) . fl3~T23| ) , gS]),g3D for C e IfoT), 

< ei(C) < [ C e 2 (s)ds + \ f fx(xo,s)ds\ 
Jo Jo 
C K 



/ |^(0(a,e),x(s,e),s;e,i/)|] <e[Li / e 2 (s)ds + ^(C) + TC : 



JO 



'i 



< £ [Lx [ C e 2 (s)ds + B hoo (T/s) + TC 1 ] =: Ri{() , (4.12) 

«/ o 

< e 2 (() < e[L 2 I e 1 (s)ds + \ I f 2 (2 Xo £s + 6 , s; u)ds\ 
Jo Jo 

+e [ \g 2 {9( S ,e), X {s,e),s;e,u)\]<e[L 2 [ e 1 {s)ds + B 2 {{) + TC 2 ) 
Jo Jo 

< e[L 2 [ ei (s)ds + B 2 ^(T/e) + TC 2 ] =: R 2 (() , (4.13) 
Jo 

where we also used that I(e, T) C [0, T/e] and where we have introduced the 
Ri as in the proof of the Gronwall inequality for a single integral inequality 
(the Gronwall inequality is discussed in many ODE books, see, e.g., [241, p. 36] 
and [261 P-310 and 317]). ( E I(e,T). 

Recall that L±, L 2 , C\, C 2 , B±, B 2 are defined in items 7,8 and 9 of the 
preamble to the theorem. For convenience we have suppressed the e depen- 
dence of t\ and e 2 . 

Before we proceed with the proof, several comments are in order. 

1. We refer to the terms -Bi(C), B 2 (C) in (13.1211) as Besjes terms since they 
were introduced by him in order to prove an averaging theorem without 
a near identity transformation; a simplification. Standard proofs use 
the near identity transformation (see e.g., [61 191 fit)]). 
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One may fear that the Besjes terms could grow as large as 0(1/ e) for 
C 6 [0,T/e], i.e., that B itOQ (T/e) = 0(1/ e). However this doesn't hap- 
pen here since, by ( 13 . 1 27j) . B\, B 2 (T, A) are upper bounds for B ij00 (T/e) 
and are e independent. Two facts are mainly responsible for this: (a) 
the fact that for fixed v\ and v 2 the integrands have zero mean, i.e., the 
quantities in f)3.122p have zero mean in s, and (b) the fact that Vx(s,e) 
and V2(s,e) are slowly varying. 

2. We maintain the system form in (I4.12p . fl4.13p . We could add these two 
inequalities and obtain an error estimate using a Gronwall inequality. 
That is, let L ro = max(L 1 ,L 2 ), B^ = B hoo + B 2)00 , = C 1 + C 2 , 
then adding gives 

< 6^(0 < e[Loc / e 00 (s)ds + BooiT/e) + C^T] , (4.14) 
Jo 

where = e\ + e 2 . The Gronwall inequality gives 

eoo(C) — e [Boo(T/s) + CooT] exp(eL OQ Q. However our system approach 

gives better bounds. 

3. We have a draft of a general paper on quasiperiodic averaging which 
uses the Besjes idea and deals with the small divisor problem (See [33]). 
However the proof we are presenting here is simple, the small divisor 
problem is trivial and the error bounds are quite explicit. Thus we feel 
it is good to give complete proofs here rather than appealing to a more 
general theory. Also it serves the pedagogical purpose of showing how 
an averaging theorem is proved in a simple context; here the context of 
f l3~T0|) . (13~TT|) and fl3T48|) . ( JS3SD - We have incorporated the Besjes idea 
in much of our previous averaging work, see [131 12HJ I27J |2U [29] . 

We now proceed with the proof. It follows from fl4.12p . fl4.13p that 

R[ = eLMO < eL x R 2 (C) , ^(0) = e[B 1)0O (T/e) + C\T\ , (4.15) 
R' 2 = eL 2ei (C) < sLzR^O , ,R 2 (0) = e[B 2)00 (T/e) + C 2 T] , (4.16) 

whence, by Appendix [J for ( G I(e,T), 

Ri(0 < ewi(eC) , R 2 (0 < ew 2 (e() , (4.17) 

where 

w[ = L lW2 , Wl (0) = B lj00 (T/e) + C X T , (4.18) 
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w' 2 = L 2 w x , w 2 (0) = B 2tOB (T/e) + C 2 T . (4.19) 

Note that in Appendix [I] we use the fact that R±, R 2 are of class C 1 . 
Solving (!%m(Ogj) we find 

wi(s) 
w 2 (s) r 

' Li 



cosh(sy/L7L 2 ) yj^i smh(sVLjT 2 ) \ / B 1iO0 (T/e) + C 1 T 
^sinh(sVCTD cosh( y s^/LT^) J \ B 2>oc (T / e) + C 2 T 

(4.20) 

whence, by ([112]) ,(03]), ([137]), 

ei(C) < < ewi(T) = e f [B 1>00 (T/e) + dT] cosh(T^Lj7 2 ) 

+[B 2>00 (T/£) + C 2 T] sinh(rV^)) , (4.21) 
e 2 (C) < ew 2 (eC) < ew 2 (T) = e([B hoo {T/e) + C^yj^w^iTy/Lj^) 

+[B 2>00 (T/e) + C 2 T] cosh^v 7 ^)) , (4.22) 

for ( e I(e,T), where, at the second inequalities, we have used the fact that 
Wi and w 2 are increasing (the latter follows from (I4.18p .( j4.19p . (l4.20p ). We 
thus have proven (I3.125I) . (I3.126I) in Theorem 1. 

We note that B\ and B 2 ^(T) are finite. Also, since the Fourier series of 
v, AP x0 ) is absolutely convergent, we conclude from (I3.13ip that B 22 (T) 
is finite whence, by (I3.129p . B 2 (T,A) is finite. 

By restricting e , and thus e in (]4.2ip . (l4.22p . we can keep e), x(C? e )) 
away from the boundary of W(9o, Xo, di, d 2 ) for ( e I(e, T). In this case T/e 
must be less than (3(e) thus I(e,T) = [0,T/e). 

To complete the proof we have to show (13.1271) which is the heart of the 
proof. Thus we have to estimate B\,B 2 . From (12 . 47p , ( 1HU6"|) , (13 . 1 2"2"j) we obtain 

q( s \ — q 2K 2 1 

fi{v 2 {s, e), s) = 2 = v 2 (s, e) = —[2AP x0 cos s + - cos(2s)]xo , 

q q 2 
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and thus, by (I3~T2T|) . ( 13~T2gj) . 

2K 2 , ft r _ . „ 



2K 2 P 

Si(C) = — | / [2AP aK) coss + -cos(23)]xods| 
Q Jo * 



= = 2AP x0 smC + -sm(2C) < = {2\AP x0 \ + -) 

q 4 q 4 

= B , (4.23) 
so that, by (1332511 . B ljOB (T/e) < B v From ( l3~36l) . ( |3~T22l .(TOjl we obtain 

/afafoe),*;!/) = -^e^ 2e ^]^i}-(n;z/,AP x0 )e^-^ + cc, 
whence, by (ETISIjl and for (6f, 

5 2 (C) = ?l / e^ 2e *° s+ ^ Vfj(n; z/, AP^Je^'da + cc| 

K 2 /" c 
= — | Y\jj(n\ v, AP x0 ) / e f[2^o«+flo] e t(n-i/) Sd5 + cc | 

<K 2 J2 \Tj{n] v, AP x0 )\ | f j^x*s e i{n-v)s ds \ ^ ( 4 24 ) 

where in the second equality we used the fact that the Fourier series of 
jj(-;u,AP x0 ) is uniformly convergent. Integrating by parts gives, for < 
C<T/e, ' 



e i2ev XO s e i(n-v)s ds \ 





< 2 + 2 g z/| X o|C < 2 + 2(fc + l)| X o|P 
In — i/| In — i/| 



whence, by fl424j) . for < ( < T/e, 



i(n — v) 



P 2 (C) < 2/i 2 [l + (jfc + l)| Xo |T] Y \ 33{n]U ' APxo) \ . (4.25) 
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The n — v in the denominator is the so-called small divisor problem in this 
context. It is easily resolved in this A-NR case. In fact, for v A-NR, i.e., 
k + A<i/<k + l — A, we have 



E 



jj(n; v, AP x0 ) \jj(k; v, AP x0 ) 



n — v 
\fj(k + l;v, AP x0 ) 



\k + l-u\ 
\?j(h+l;v, AP x0 ) 



\k — v\ 

E 

ne(Z\{fc,fc+l}) 



\jj(n; u,AP x0 )\ \Jj(k; u, AP x0 ) 



\n — u 



A 



ne(Z\{fc,fc+l}) 

whence, by fl3TT29l) . fl3TT3^ . fl3TT3T]) .f l05|) . 

\fj(k; u, AP x0 )\ + \jj(k + 1; u, AP x0 ) 



B 2 (()<2K 2 {l + (k + l)\xo\T}{- 



A 



+ Yl \jj(n;u,AP x0 )\} = ^B 21 (T) + B 22 (T) = B 2 (T, A) 



n6(Z\{fc,fc+l}) 



so that, by fl3~T23|) . B 2)OQ (T/e) < B 2 (T, A). 
This completes the proof. 



(4.26) 



4.2 Proof of Theorem [2] (Averaging theorem in NtoR 
case where v = k + ea) 

The proof goes analogously to the proof of Theorem [T] in §4.11 and so we omit 
some details. 

Thus we begin by comparing solutions of the exact IVP (I3.48p . fl3.49p 

0' = ef*( X , + e 2 <7?(0, X, C, e, a) , 5(0, e) = 0„ , (4-27) 
rt = efX(6,eC 1 C;k 1 a) + e?gX(6,x,t,e,k,a), x (0,e) = X o, (4.28) 

where, by , dSSDD , , 

A H (X,C) = ^, (4-29) 
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f R (6, e£ C; k, a) = ~ ex V (t[k6 - ae(}) ^jj(n; k, AP x0 y^ + cc 



2 

ngZ 



(4.30) 



with the normal form I VP of fl33H|) . f l339|) 

v[=ef*(v 2 ), v 1 (0,e) = 9 , (4.31) 

v' 2 = ef R {v u e(; k) , v 2 (0, e) = Xo , (4.32) 

where 

/i> 2 ) = 2v 2 , (4.33) 

f 2 R {v x ,e(] k) = - — exp(i[fcui - aeC])jj(^ fc, AP^) + cc . (4.34) 

Subtracting and integrating, we obtain from 03.1430 . fl4T271) . fl42^ . fl4311 . fl4T3^ 
that 

(9(C) -t^(C,e) = e f [f R ( X (s),s)-f R (v 2 (s,e),s) 
Jo 

+fi(v 2 (s, e),s)- f R (v 2 (s, e)) + eg R (6(s), x(s), s, e, k, a)] ds 

" C [fiix(s), s) - f?{v 2 (s, e), s) 
+f?(v 2 ( 8 > £ )i s )+ £9?(8{s), X(s), s, e, k, a)] ds , 

(4.35) 

and 

X{0 -«2(C.e) = £ / [f 2 {0(s),es,s;k,a)- f 2 (v 1 {s,e),es,s;k,a) 
Jo 

+/-f OiO; £ ), £s , s ; k, a) - f 2 (vx(s, e), es; k) + eg 2 (6(s), x(s), s, e, k, a)] ds 

= e I [f2(0(s),es,s;k,a) - f 2 (v 1 (s,e),es,s;k,a) 
Jo 

+f2 R ( v i( s i £ ), £s , s ; k, a) + eg R (9(s), x(s), s, e, k, a)] ds , 

(4.36) 

for ( G I(e,T) = [0,T/e] PI [0, /3(e)). Taking absolute values, applying the 
Lipschitz condition and defining 

ei (s) := \9(s)-v 1 (s,s)\ , (4.37) 
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e 2 (s) := \ X (s)-v 2 (s,e)\ , (4.38) 
gives, by fETPTHli . fITT7T~TTTT) . flHTTTTI) . (|5^ . fl5^ .(05)l r (05jl for C e 7(e,T), 



< ei(C) < e[Lf / e 2 (s)ds + \ f*(v 2 (s,e),s)ds\ 
Jo Jo 

+e f \g?(9(s),x(s),s,e,k,a)\ds]<e[L* [ e 2 (s)ds + B*(() + TC*} 
Jo Jo 

< e[Lf f e 2 (s)ds + B^T/e) + TCf ] , (4.39) 

0<e 2 (()<e[L% e 1 {s)ds + \ I /*(vi(s, e), es } s; k, a)ds\ 
Jo Jo 

+e [ \g 2 H (9(s), X (s),s,e,k,a)\ds}<e[L^ [ ei (s)ds + B*(() + TC?} 
Jo Jo 

< e[L* f ei (s)ds + Bj^CT/e) + TC*] , (4.40) 

Jo 

where we also used that I{e,T) C [0, T/e]. Recall that Lf ,Cf ,Bf are 
defined in items 7,8 and 9 of the preamble to the theorem. 

We are now in the same situation as in the proof of Theorem [I] since 
replacing L h C h Bi in (HISj) . (Q5]) by Lf,Cf,5? results in flCTD . fOOj) . 
Since, as shown in the proof of Theorem [TJ (02]l . (051> entail ( OTj) . fl4T22l 
we thus conclude here that fl4.39p . fl4.40l) entail: 



ei(C) < ef [^(T/e) + C X T] cosh^y^Lf Lf) 

+[Bj 0O (T/e) + C 2 T] J^sinh^yZfLf)^ , (4.41) 

e 2 (C) < ^[^(T/e) + C X T] j|| sinh(Ty / LfLf) 

+[Bj 0O (T/ £ ) + C 2 T] cosh(T V / LW)) , (4-42) 

for C G 7(e,T). We thus have proven fl3Tl^ .f l3TT46|) . 

Clearly, by (J3TT1B]) . Bf(T) is finite. Also, since jj(-;u, AP x0 ) is a C°° 
function, the series on the rhs of ( 13. 149[) converges whence B 2 (T) is also 
finite. 
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By restricting e , and thus e in (I4.4ip . (l4.42p . we can keep (0((, e), x(C, e)) 
away from the boundary of W(9 , Xo, d±, d 2 ) for ( G I(e, T). In this case T/e 
must be less than (3(e) thus I(e,T) = [0, T/e]. 

To complete the proof we have to show (I3.147p . Thus we have to estimate 
Pf , P 2 R and beginning with Pf we conclude from fl2T47l) . fl31l3|) .f l439|) . f l4T33|) 
that, for (Gl, 

2i^ 2 1 
= — \2AP x0 coss + - cos(2s) Ws, e) , 
q 2 

whence, by (1X821) . flXT4^1) .f l37[48l) . ( 1CT]) .( Q4) for < C < T/e, 



2K 2 



q 



[2AP x0 cos s + - cos(2s)]i> 2 (s, e) cfe 



2K 2 1 

— [2AP.Q sin C + - sin(2C)MC, e) 

— / [2APZ0 sin s + - sin(2s)] — -(s, e)cfe 
io 4 « s 



2A' 2 



[2AP x0 sinC + -sin(2C)]^ 2 (C,^ 



+eK 2 fj(k;k,AP c 



x0) 



C 1 
[2 APj-o sin s + - sin(2s)] cos ( kv i (s, e) — eas ) 



< 



2K 2 



[2|AP, | + -]h(C,£) 



+eif 2 |jj(^^A^.o)|[2|AP :!; o| + -]C 



2K 2 
< [ 2 |AP,o| 



< 



Q 

2K 2 



v 2 ((,e)\ + K 2 e(\jj(k-k,AP xC 



[2\AP x0 \ + -\ (xoc(9o,Xo,k,a) 



+K 2 T\fj(k;k,AP x0 )\^ = Pf(T) 



(4.43) 



so that, by flBTT^ . P 1 fi oc (T/e) < Pf (T) which proves fl3TT47j) for z = 1. The 
key step here is the integration by parts at the second equality which makes 
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explicit the slowly varying nature of v 2 by pulling out the explicit e after the 
third equality. 

To prove fIBTWl) for % = 2 we conclude from f[3TT43D . (TOO]) . that, 
for ( G R, 



fg(v i(s, e), es, s; fc, a) = -^-e 



whence, by fl3TTi2l for ( G R, 



i[kvi (s,e)—eas] 



33{ n \k,AP x0 )e 



i(n—k)s 



CC 



n£Z\{k} 



K 2 



D i[kvi(s,e)—eas] 



]T fj(n;k,AP x0 )e^ s ds + cc 



nSZ\{fc} 



<K 2 \jj(n;k,AP x0 )\ I ^ e i[kvi(s,e)-eas) e i(n-k)s d ^ ? (444) 

neZ\{ft} ^° 

where in the inequality we used the fact that the Fourier series of k, AP x0 ) 
is uniformly convergent. Integrating by parts gives, by ( I3.82p . (I4.3ip . (l4.33p 

for < C < T/e, 



D i[kvi{s,e)— eas] i{n— ft); 



ds 



ds\ 



i(n — k) 



e i[fc«i(C,e)-eaC] e i(n-fc)C _ e «ft6>o 



< 



< 



< 



< 











/o 




1 




\n — 


k\ 


1 




\n — 


k\ 


1 




\n — 


k\ 


1 





2+ f (k\^(s,e)\+e\a\)ds 
Jq ds 

2 + 5 f (2k\v 2 (s,e)\ + \a\)ds 
Jo 



2 + e( 
2 + T 



\a\ + 2kxoo(Oo,Xo,k,a) 
a\ + 2kXoc(QoiXo,k,a) 



\n — k\ 

whence, by flBTM . fTCTD for < C < T/e 



B*(C) <K 2 [2 + T [|a| + 2k X oo(e , X o, k, a)] 



(37 



x E 




AP x0 )\ 



(4.45) 



n — k 



n£Z\{k} 



so that, by fl3TT44D . B^T/e) < B${T). This completes the proof. 

5 Summary and future work 

We started with the 6D Lorentz equations for a planar undulator in fl2.7p . fl2.16p - 
(I2.18P with time as the independent variable. In §2.21 we introduced z as the 
independent variable and considered the I VP at z = with y = p y0 = 0. 
Solutions of this system are completely determined by the solutions of our 
basic 2D system (I2.33p . fl2.34p for a and 7. This basic 2D system is the 
starting point for the rest of the paper and the first step is to transform 
it into a form for first-order averaging; the subject of §2.31 We introduce 
C = k u z as the new independent variable, and x as a new dependent variable 
by 7 = 7c (1 + ex). Here we are thinking of electrons as part of an electron 
bunch with 7 C as a characteristic value of 7 and e as a measure of the en- 
ergy spread so that \ is an 0(1) variable. We thus arrive at the system for 
{Oaux, X) given in fl2.4ip . fl2.42p and we are interested, in this FEL application, 
in an asymptotic analysis for e and l/j c small. Expanding the vector field 
for (12~H1) . (|2"12|) gives (12301) . (12311) . Here 9 aux is not slowly varying and we 
thus introduce the generalized ponderomotive phase, 9, in (12.521) which leads 
to the slowly varying form of (I2.56p . (12.571) . Most importantly, we discover 
that in order for 9 and x t° interact at first order we must have e = 0(l/j c ) 
and without loss of generality we take (11.51) as a result of (I2.58j) . Finally we 
obtain (I2.62p . (12.631) which is in a standard form for the MoA. Consequently 
this will lead to a pendulum type behavior which is central to the operation 
of an FEL. 

The MoA can be applied to fl2.62l) . (l2.63l) after an appropriate h is defined 
and the rest of the paper, in Sections [3l4l focuses on the monochromatic case 



Before continuing with the summary we note that in the collective case 
there is a continuous range of frequencies and so it is natural to ask, "what 
happens in the noncollective case considered in this paper if there is a con- 
tinuous range of frequencies?". Here h can be modeled as in (12.781) . i.e., 



of d235D. 




(5.1) 



OS 



In the nonsmooth monochromatic case h(£) = [<5(£ — v) + S(£ + v)]/2 and 
(15.11) gives h(a) = cos(V«) as in the monochromatic case of f|2.15p . and, as 
we have discussed in ^|3l there are resonances for integer v. However we have 
found that in the smooth case the average of (cos£ + AP x0 )h(9 — Q(()) is 
zero and so the averaging normal form for (I2.62p . fl2.63p is just the NR normal 
form of §3.31 Thus a smooth localized near the v = 1 monochromatic 
resonance, washes out the effect of that resonance in the first-order averaging 
normal form. This does not mean that there is no resonant behavior near 
v = 1 because it may not be possible to prove an averaging theorem. We 
are pursuing this. Furthermore even if an averaging theorem can be proven 
there might still be an effect in second-order averaging. 

In §0 we begin by determining the 0(e 2 ) terms of ( I2.62p . fl2.63p using 
( 1277211 . ( 127751) . Th us we obtain f[37T0|) - f[37T75|) as our basic system for 9,x- 
Proposition [1] gives a domain, W(eq) x R, on which gi,g2 are well defined as 
well as their limits as e — > 0+. In particular the vector field in ( 13 . 1 1) . ( T37TT]) 
is well defined on W(e ) x R. 

Eq.'s fl3.10p . fl3. lip are in a standard form for the MoA and for each v 
the normal form is obtained by dropping the 0(e 2 ) terms and averaging 
/ij/2 over (. However the average of f2 is not clear from (I3.13P and it is 
convenient to expand it in a Fourier series which is given in fl3.26p -f l3.28p . 
The average is then easily obtained in f|3.30p and leads to the definition 
of NR, A-NR, resonant and NtoR v. The NR normal form equations are 
9' = e2\ and x' = and the resonant normal form equations are given 
by (I3.3ip . The NR case is stated precisely in §3.31 Instead of focusing on 
the resonant case of f|3.3ip we consider in §3.41 the more general NtoR case 
where we study the dynamics in neighborhoods of the v — k resonances. 
If the neighborhood is too small then the resonant normal form of f !3.3ip 
will be dominant thus the natural neighborhood to study with first-order 
averaging is 0(e) and this is the content of §3.41 Replacing v by k + ea, 
our basic equations fl3.10p . fl3.lip are rewritten in fl3.42p . fl3.43p . The function 
fi in (I3.43P has two e dependencies one of which contributes to the 0(e 2 ) 
term and we are led to the basic NtoR system fl3.48p - ()3.52p . Proposition [2] 
is analogous to Proposition [T] by giving us the domain W(eq) xKon which 
gi,g% are well behaved as well as their limits as e — > 0+. In particular 
the vector field in (157351) . ( 157451) is well defined on W(e ) x E. In 3374721 the 
NtoR normal form is presented in fl3.58p . fl3.59p . The solution structure is 
conveniently illuminated, in terms of the simple pendulum system, in §3.4.31 
The simple pendulum exhibits four types of behavior and these are exploited 
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to discuss the structure of solutions of (j3.58p . (l3.59p in these four cases. 

At this stage we have normal forms for v G [k + A, k + 1 — A] and 
v = k + ea. However there may be gaps between the dynamics covered by 
the A-NR normal form and that of the NtoR normal form. So it is comforting 
to note that there is a link between the two dynamical behaviors in that the 
NtoR normal form is approximated by the NR normal form far away from 
the pendulum buckets as discussed in §3.4.41 

In §3.51 we state the two averaging theorems which relate the A-NR and 
NtoR normal form approximations to the corresponding exact systems. Each 
theorem has a detailed preamble which sets up a compact statement of the 
theorem. The theorems establish the main results of the paper. Namely that 
the normal form solutions give an 0(e) approximation to the exact solutions 
on long time, 0(l/e), intervals. In the A-NR case, the v interval can be 
made larger by making A smaller but this is at the expense of increasing the 
error as discussed in Remark (1) of §3.5.3. 

The results of the theorems are applied in §3.6[ where the normal form 
approximations are used to derive the approximate solutions of the Lorentz 
equations with z as the independent variable. In §3.71 we discuss the small 
gain theory for v = k + ea based on our NtoR normal form and compare it 
with the standard theory for k = 1, a = 0. We do point out however, that we 
have not justified the low gain theory in the context of our NtoR averaging 
theorem as we mention at the end of §3.7. 

Finally the proofs are given in §|H It can be seen that the proofs them- 
selves are quite simple. The proofs are somewhat novel in that they do not 
use a near identity transformation, due to the Besjes approach, and they use 
a system of differential inequalities in the calculation of the error bounds, 
rather than a Gronwall type inequality, which leads to better error bounds. 
Therefore a solution of the system of differential inequalities is presented and 
verified in Appendix [H The first theorem, which is stated for the A-NR case, 
is an example of a quasiperiodic averaging theorem with its concomitant 
small divisor problem. It's inherently interesting in that the small divisor 
problem arises in what must be the simplest possible way. We develop the 
general theory of quasiperiodic averaging in [2]. The second theorem, which 
is stated for the NtoR case, is an example of periodic averaging which has 
a vast literature, however as mentioned above our approach here is novel. 
While the proofs of Theorems 1 and 2 are simple the whole application of 
the Mo A is not. There was considerable work to put the problem into the 
standard form and considerable effort to calculate the bounds on gi,gz in 
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Appendix O and g±, g^ in Appendix |E] as well as their e = limits in Ap- 
pendixes [B] and [D] 

We now comment on future work. First of all it would be interesting to 
include the y dynamics using (12.12)) as we do, but not assuming the zero 
initial conditions in y, thus treating the full 3D dynamics. 

Secondly, it would be interesting to study the helical undulator as we 
have done here for the planar undulator, i.e., via first-order averaging. 

Thirdly, the work here sets the stage for a second-order averaging study 
of the NR case in f l3~T0|) .f l3~TT|) using fl3T39|) . (EOOjl and the NtoR case in 
f l3.48p .( l3T49|) using (I3.56p .( l3~57]) . In both cases we have systems of the form 

^ = eF(U, t) + e 2 G(U, t) + 0(e 3 ) , (5.2) 
with approximating normal form given by 

^ = eF(V) + e 2 G(V) , (5.3) 

where F is the i-average of F and G is a linear combination of the t-average 
of G and terms depending on F (See [251 Section 5, p. 610] for a construction 
of the normal form, i.e., G, and an associated theorem and proof). Such 
a study would include a computation of the averages from (I3.39p . (l3.40p and 
(I3.56p . (l3.57p and then a phase plane analysis of this second order normal form 
system including a comparison with our first-order normal form system. In 
addition averaging theorems could be proven which we anticipate will give an 
0(e 2 ) error on [0,T/e] as in [25]. Furthermore, it would be interesting to see 
what happens in the NR case, e.g., is the energy deviation x still conserved. 
We note that generically second-order averaging gives a better error estimate 
but the interval of validity remains the same (See [25] for situations where 
the time interval can be extended). Finally it would be interesting to know 
if, in the NtoR case, there is a breakdown in the integrability of the NtoR 
normal form due to separatrix splitting, [30], with the concomitant chaotic 
behavior. This is a delicate issue, which cannot be studied with second-order 
averaging, since ( 15. 3 p is a second order autonomous system and as such it 
cannot exhibit chaos as pointed out at the end of §3.4.31 This work could 
be a possible future project, however it does not appear to be interesting 
from the application point of view since collective effects are surely more 
important than noncollective effects at second order. 
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Fourthly, we are therefore eager to move on to the collective case based 
in part on our understanding here. As a first step we are studying the 
consequence of flH.lj) - (IH.6l) . We have not seen this form of the solution of the 
ID wave equation in the FEL literature although the first equality in flfi.3|) 
is derived in many elementary PDE books. In addition, we are pursuing the 
issue raised in the paragraph containing Eq. (15. ip . concerning a smooth h. 
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(ll.4|> 
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K 
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K 


(|3.60|) 
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NR (nonresonant) 
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N 
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fl2.37P 
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fl2.47p.fl2.48p.fl2.53p 
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fl3.l9p.fl3.H3p.fl3.133p 


Z 
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7c 
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A 


Definition [U(^3~2l) 


A — NR (A — nonresonant) 


Definition [T](gOP 


AP x0 


(I2.45P 


e 


(11.50 


c 


(I2.39P 


V 


fl2.36P 
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O2.40p.fl2.52p 




fl3.2p.fl3.3p.fl3.17p.fCM5) 


T ,Ti 


fl2.54p 


X,Xib(e) 


O2.36p.(l3.20p 


n 


03.69P 
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Appendix 



A The Bessel expansion 

Here we derive the Bessel expansion (I3.27P of v, AP x0 ). In fact by (I3.23P 
AP X . ) = (cosC + APa-o) exp(-zz/T sinC) exp(-iuT 1 sm2() 

= \jji{0 + \n-i{Q + AP,ojjo(C) , (A.i) 

where 

JJm(C) := exp(imC)exp(-w[T sinC + Tisin2C]) . (A. 2) 

Now 

exp(ixsin6 l ) = J n (x) exp(in9) , J- n (x) = (—l) n J n (x) , (A. 3) 
whence, by ( 1A.2I) . 

= J] J I (i/To)J fc (i/T 1 )e^ m - , - a *)« 

= ^-n^fc^To)^^!)^^ . (A.4) 

Let 

^(n, m, i/, T , Ti) := y~] J m _ n - 2fc (z/T ) J fc (i/Ti) , (A. 5) 

then, by (1A.4|I . 

iUO = E «7(n, m, !/, T , TOe^ , (A.6) 

and thus, by ( 1A.1I) . 

jj(C, u, AP x0 ) = (lj(n, 1, i/, T , Ti) + l -J(n, -l,u, T , T x ) 
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+AP x0 J(n,0,u,T ,T 1 )y in < , (A.7) 

whence, by (I3.25p . 

jj(n; is, AP x0 ) = -J(n, 1, v, T , Ti) + ^Ji^, -1, T , Ti) 

+AP rf( 7(n,0,i/ I T Ji) ) (A.8) 

so that indeed (I3.27P holds. 

It is useful for the discussion after Definition [TJ to have the following 
special case. We have, by (1A.8|) . 



fj(k; k, 0) = l -[J{k, 1, k, 0, Ti) + J(k, -1, fc, 0, Tx)] , (A.9) 



where 



«7(fc, 1, fc, 0, TO = Ji- k -2k>(0)J k >(kTi 



fc'ez 

J(i-k)/2{kTi) if fc odd 
if A; even 



(A.10) 



J(k, -1, A;, 0,Ti) = ^ J_ 1 _ fc _ 2fc /(0)J fc /(A;T 1 ) 

= J ^-(i+fc)/2(A:Ti) if A; odd - A n 

^ if fc even . 

Thus from flA.9h jj(k; k, 0) = for even and, for = 2n + 1 with nGZ, 

5(2n + 1; 2n + 1, 0) = ^[J- re ((2n + 1)T0 + J- (n+ i)((2n + 1)T0] 
= ^(-l) n [4((2n + 1)10 " ^n+i((2n + 1)T 1 )] . (A.12) 

B Limit of g\, g<i 

Let ee(0, e ] with e G (0, 1] , let z/ G [1/2, oo) and let (9, x, C) e W"(e ) x K- 
In this appendix we will prove the properties (IB.5j) . (lB.8p . flB.12p . flB.13P of g± 



and g 2 . The properties (IB . 8 1) . f TBTT3|) are used in the proof of Proposition [TJ 
Furthermore the properties (IB.5p .f lB.12p will be used in Appendix O Since 
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all assumptions of this appendix are also satisfied in Appendix [Bj we can 
apply the results of Appendix [B] 

We first consider g x . Note that, by (pTlT]) . (ET2 |) . 



l + K 2 U 2 x (9X,e,u) = q(C) 
e 2 K 2 q 



2v 



s\n{v[9 - Q(C)]) - sin(^ ) 2(cosC + AP : 



+ ^(sin(^-g(C)])-sin(^ ))) • 



(B.l) 



We obtain from ( ETT3|) that 



9 2£ 



e 2 q n z (9,x,C,e,v) Q 



whence 
1 



1 



£ qU z (U z + l)eV = n 2 - 1 + ^gn z (n, + l)e 2 (l - 2e X ) 

(1 + £x)2 y-j{q + eV) + ^<?n z (n z + l)e 2 (l + e X ) 2 (l - lex) 



(B.2) 



where we used from (l3.3j) . (lB.lj) the fact that 



U 2 z (9,xX,e,u)-l 



S(l + e X ) 2 



g(C)+e 2 Ki(^,C^^) , (B.3) 



with 



K 2 q 



sin(i/[0 - Q(C)]) - sin(z/fl ) 2(cosC + AP 3 



+ 0(sin(z/[0-Q(C)])-sin(^ o )) ) . 



(B.4) 



Clearly, by flR2j) . flB~3]) . 
1 



2£ 



gn 2 (n, + i)eV 



e 2 q 



£(l + ex¥ 



i--n,(n a + i)(i-3 £ V-2£ 3 x 3 : 



£ 4 Kl 



f(l + ex) 5 
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l(U z - 1)(U Z + 2) + V(II Z + l)(3eY + 2£ 3 X 3 ; 



£(l+e X ) 2 

whence 

-Lqn z (n z + i) 2 e 4 gi 

e 2 q 



2£(l + e X ) 2 



(II 2 - l)(n, + 2) + e 2 Tl z (Tl z + 1) 2 (3 X 2 + 2e X 3 ) 



4 2/2 

(g + e 2 Ki)(II, + 2) 



£ fil £ ? ( e f 2 



£(l + e X ) 2 2£(l+e X ) 4 V^ 
+e 2 n,(n 2 + 1) 2 (3 X 2 + 2e X 3 )(l + e X ) 2 



A 

e Ki 



£(l + e X ) 2 



( £ \{Tl z + 2) + e 2 Yi z {Ii z + 1) 2 (3 X 2 + 2e X 3 )(l + ^) 2 



2£(1 + 5 X ) 4 V^ 
e 6 q(U z + 2)k! e 4 k x 



2S 2 (l + e X ) 4 S(l + e X ) 2 

2£{fhxf (j q{Uz + 2)+ £2Uz{Uz + 1)2(3X2 + 2£X " ){1 + £X)2 

4 / 2 



2(l + £ X ) 2 + -g(II, + 2) 



28(1 + exY\ A/ S 

so that 

gn,(n 2 + i)V 



' q (U z + 2) + U z (Tl z + 1) 2 (3 X 2 + 2e X 3 )(l + e X ) 2 



(l+ex) 4 \£ 



2(l + e X ) 2 + ^(n 2 + 2) , 



(1+^) 4 V ^ 
i.e., 

^ *• C; £ ' "> ^ - ^wr^ (!^ + 2 ) 

+n z (n, + i) 2 (3 X 2 + 2£ X 3 )(i + ^) 2 
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Clearly, by (I33j) . (lB~4j) . 

lim \n z (9,x,£,e,v)] = l, (B.6) 

lim [ Kl { X , C, e, i/)] = — (sm(u[9 - Q{()\) - sm(v9 )) (cos ( + AP x0 ) , 

(B.7) 



whence, by (1B.5j) 



hm [^(e, x,C;^)] = (|g(C) + i2* 2 ) 

/ sin(z/[0 - Q(C)]) - sin(^ )l (cos C + AP x0 ) . (B.8) 



.tO 



2v 

We now consider #2 and we obtain from (I3.15P that 

e 2 g 2 {9, X , C; e, i>) = ^ 2 cos(i/[0 - Q(C)]) f cos C + AP 

1 n a (g,c,e^) 

1 + ex H z (6,x,C,e,v) 

whence 

n 2 (l + ex)eg 2 = K 2 cos(u[9 - Q(()}) ((1 + £X )n,(cosC + AP x0 ) - n a 

= K 2 cos{v[9 - Q(C)]) ((cosC + AP E0 )[(1 + e X )n, - 1] - £ 2 k 2 ) ,(B.9) 

where we used from (13. 2 p the fact that 

U x (9,(,e,u) = cos( + AP x0 + e 2 K 2 (9,(,v) , (B.10) 



with 



« 2 (0, C, 1/) := |;Mi/[0 - Q(C)D - sin(z/0 o )] . (B.ll) 



Clearly, by (lR9|) . 

n,(l + e X )eg 2 = K 2 cos(z/[# - Q(()}) I (cosC + AP x0 )[U z - 1 + e X U z ] - e 2 n 2 



whence, by (1B.3|) . 

(U z + l)U z (l+e X )eg 2 

= K 2 cos{v[6 - Q(C)}) ( (cosC + AP x0 )[U 2 z - 1 + e X U z (U z + 1)] 



-e 2 k 2 (U z + 1) 

= K 2 cos(^ - Q(C)]) ( (cosC + AP x0 )[-—— -(q + e 2 ^) 



e 1 



S(l+e X ) 2 

+e X H z {U z + l)]-6 2 n 2 {U z + l) 



so that 

n z (n, + i)(i + £ X ) 3 ^ 2 



K 2 cos(z/[fl - Q(()}) ((cos C + AP xQ ) i~j(q + e 2 k x ) 
+e X n,(n z + 1)(1 + exf] - e 2 n 2 (U z + !)(!+ e X )^j 



which entails that 

n z (n, + i)(i + e X ) 3 22 



K 2 cos{u[6 - Q(C)]) (^(cos C + AP x0 ) [~(q + £ 2 «i) 
+ X n 2 (n 2 + 1)(1 + exf] - sk 2 (U z + !){! + exf 



i.e., 



^ 2 cos(^-Q(C)]) / / A , AD u e,,,. , , 



x, c; s, ,) = , M „ ; , H , ^(-s c + AP, ) [--(,(o + s^o 

+ x n z (n, + 1)(1 + ex) 2 ] - sk 2 (U z + 1)(1 + ex) 2 ) • (B.12) 
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Clearly, by flR6|) . (|RT2|) . 



lini lg 2 (6,x,(;e,v)]=xK 2 cos(vi9-Q(()])(cos( + AP x0 ). (B.13) 



C Bounds on ^1,^2 



Let e E (0,£ ] with e G (0, 1], let v e [1/2, oo) and let (0o,Xo) e W(e )- Let 
also 



Xz&(£o) < -Xm 



(C.l) 



where xm is the positive constant from Theorem [T] (see item 2 of the setup 
list for Theorem [1]). We also assume that 



X,C) e M x (xo -d 2 ,Xo + d 2 ) x 



where 



< d 2 < Xo ~ Xib(£o) 
Note that, by fl3TT^ . f[3^ .f lU^ . flC3|) . 



(C.2) 



(C.3) 



X,C)e Rx(xo-4Xo + 4)xf C ( W(e ) x R ) cP(e,z/) . (C.4) 



In this appendix we will prove the properties ( 1C.27I) .( IC.30I) of g\ and g 2 . 
We thus show in this appendix that the properties ( 1C.27f) .( IC.30l) hold in the 
situation of Theorem [T] (see item 8 of the setup of Theorem [T]). Moreover the 
properties (lC.27j) . (1C.30j) will be used in Appendix |E] 
We first consider g\ and we obtain from (IB. 51) 



0i 



qU z (U z + mi+exY\£ 
+U z (U z + l) 2 (3x 2 + 2e X 3 )(l + e X ) 2 



(n 2 + 2) 



g n,(n 2 + i )2 (i + ^V 2(1 + ^ )2 + ¥^ + 2 ) 



(C.5) 



It follows from f lCTj) . f l2^8|) . fl3T6|) . (|57)» . (|dj» that 

g > , g > , l + ex>0, < II 2 < 1 



SO 



3x 2 + 2e X 3 = X 2 + 2x 2 (l+£x)>0, 

(C.6) 



whence, by (IC.5j) . 



+n z (n 2 + i) 2 (3 X 2 + 2e X 3 )(i + ^) 2 



\Ki\ f n/ _ i 2 i e*q i 



+ q u zi n^mi + e X r l 2(1 + £X) + ^ + 2) 

« ( «(n. + 2) l3x2 + 2£x 3 



g(i + 5 X ) 2 V^n 2 (n, + 1)2(1 + e X )2 
i N ( gMn^+2) - 

g-IL,(n, + 1)^(1 + e X ) 2 V + £(l + e X ) 2 1 



Note also that, by fl3~T6|) . 



n2 , fl > x , ^ 2 i + ^ 2 n 2 (#,c,^) 



S (l + ex) 



2 



e 2 l + K 2 n 2 .( e ) 

a 1 - j ( i +e ff ' (c ' 8) 

Moreover £ 2 /(l + ex) 2 and 1 + K 2 Ii 2 x ub (e, v) are increasing w.r.t. e whence, 
by (ESD, 

n ^X,C, e ^)>l-j (1 + £qx)2 ■ (C9) 
Since < e < e we have, by (1C2[) . 

1 + ex > 1 + ^(xo - d 2 ) > 1 + inf (e(xo - <fe)) = 1 + mzn(0, e (xo ~ d 2 )) 

£ e(0,e ] 

=: k 3 (xo,^o,4) • (C.10) 
Note that, by (l3~20l) . ( !Cl3l . 

1 + ^o(xo - 4) > 1 + £oX«b(^o) > , (C.ll) 
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whence, by (1(XT0]) . 

K 3 {xo,eo,d 2 ) > , (C.12) 
so that, for n G N and by flC.101) . 



(l + ex) n ^s(Xo,£o,4) 
It follows from f l09|) . (ICl3|) . 

n^, x ,C,e J i/)>n ZlI6 (e ), (C14) 

where 

n a ,i 6 (e) := 1 - £ L TV • ( C - 15 ) 

£Kf 3 (xo,e,d 2 ) 

To show that ^(£r ) > we compute, by using (I3.20p . 

+ K 2 Ul ub (e ) = n +£oXlb (e ) \ 2 

If Xo < then, by flUTTOj) . flUTT]) . 

«3(Xo, £o, 4) = 1 + £o(Xo - d 2 ) > 1 + £oXi6(£o) > , (C.17) 

whence 



< 1 ± £oX ' 6( ;°j < 1 , (C.18) 



so that, by fjC16j) . 



E M^W < t . (c , 9) 

If xo > then, by dSUDD , dHU} , ddHID , 

k 3 (Xo, £o, d 2 ) = 1 > 1 - £oXm > 1 + £oXib(£o) > , (C.20) 

whence again (1C.18|) holds which entails ( IC.19j) by ( IC.16j) . Having thus 
proven (1C.19I) we conclude from (1C 1 5[) that 

ftz,i6(eo) > , (C.21) 
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whence, by (jC^jl ■ (IHlijl . 



where 



n ZiI6 (e) := Vn z , I6 (e) 



1-e 



Of course since IT Z , H z jb > we conclude from (IC.22j) that 



(C.22) 



(C.23) 



1 



< 



1 



n z (0,x,c,£,^) n e)I6 (e ) 

Inserting <KJI\i . (IUl3|l . (1U^4|) into (JUT]) yields to 

3g 



(C.24) 



^(Xo, £o, d 2 ) \SU Ztlb (e )Kl{xo, £0, d 2 



+ 3 X 2 + 2 £o |x| ; 



gn^ i/fe (e )fi:|(xo, £o, d 2 ) \ £ «l(Xo, ^0, <fe] 
Furthermore, by flCTj) . flRij) . (jTT2|) .( Rl6|) . 



(C.25) 



Ixl = |X - Xo + Xol < \x ~ Xo\ + IXol < d 2 + |xo| , 

C, e, 1/) I < — ( 2 + 2| AP x0 \ + —) < — ( 2 + 2|AP.o| + ^ 



g(C)<i + ^ 2 (i + |AP,o|) 2 =:g^. 



(C.26) 



Inserting fl026|) into fl025|) yields to 

q u b 



\gi(P,x,C;e,v)\ < 



x 



3<2W> 



uH Zt i b (e )Kl{xo,£ ,d 2 ) 
=: d{xQ,£o,v,d 2 ) . 



3^0 Qub 



£re|(Xo,£o,d 2 ) 

(C.27) 



S3 



We now consider g 2 and we obtain from (lB.12p . (lC.6p 

K 2 



\92\ < 



n.(n. + i)(i + ex )»V (1 + |AP - ,)[ ? (ff + ^ M 

+ | X |n z (n z + 1)(1 + e X ) 2 } + s \k 2 \(T1 z + 1)(1 + e X )' 



K 2 



e (l + \AP x0 \) 
SIl z (U z + l)(l+ex) 3 

£o|«2| 



(g + eo|«i| 



IxlO+JAggj) 

1 + 



n,(i + ex) 

Note that, by flBTT]) . (1061) . 



(C.28) 



M0,C,^)l < 



'7 



(C.29) 



Inserting KJM . <KJl3h . KJSty . (IU^6|l . (jC25)) into f!U^8|) yields to 

1^(0, x>C;e>f)l 

e (l + |AP a0 |) 



< K 



£U Z)lb (e )K 3 3 (xo, e , d 2 ) 

(rf 2 + |xol)(l + |AP 3! o|) | 

«3(Xo,£o,^2) 
C 2 (xo,e Q ,v,d 2 ) , 



2^Vo 1 nIA d I 1 £ 0<?- 



9u6 + ^— ^(2 + 2|AP x0 | + ^; 
i> v 



vTl z j b (eo)K 3 (xo,£o,d2 



(C.30) 



where K 3 ,U z>lb ,q ub are given by flOTOl . (1023]) . (1026]) . With fl027j) . (IO30|) we 
have shown that gi(-, v) and ^("j z/ ) are bounded for u > 1/2 for the points 



X, C, <0 e K x (xo - 4, Xo + 4)xRx (0, e ] . 



(C.31) 



D Limit of , 

Let e G (0,e ] with e E (0,1] and k E N, a G [-1/2,1/2] and let (0,X,C) e 
W(e ) xR. In this appendix we will prove the properties (1D.ip . (|DT2]) . (jD.3|) . f]D.5[) . 
(ID.7p . flD.lip of and g 2 . The properties (ID.2|) .f lDTTTj) are used in the proof 
of Proposition [2J Furthermore the properties flD.ip .f TD~3~]) .f lD.5p . (ID. T[) will be 
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used in Appendix [El Since all assumptions of this appendix are also satisfied 
in Appendix [Bj we can apply the results of Appendix [Bj 
We first consider g\ and we obtain from (I3.5ip . (IB.5j) that 



9i X, C, £ > k, a ) = 9i(9, X, C',£,k + ea) 



^(n z + 2) 



qU z (U z + 1) 2 (1+ e X ) 4 \S 
+ U z (U z + l) 2 (3 X 2 + 2ex 3 )(l + e X ) 2 



qU z (U z + 1) 2 (1 +exY 



2(i + ex) 2 + ^(n z + 2) 



(D.l) 



where Ii z = U z (9, x, C> £ y k + ea) and Ki = K\(9, (, e, k + ea) whence, by 



lira [gf {9, x, C; e, fc, a)] = lim [c/i (0, x, C; £, *0] 

e— s>0+ e— s>0+ 



^(t?(C) + 12X 2 ) 



if 2 / 
2^ V 5 



4g v £ 

sin(A;[0 - Q(C)]) - sin(fc0 o )J (cosC + AP x0 ) . 

We now consider g,f and we conclude from (I3.55p that 

9* {0, X, C, e, k, a) = g^(9, X , e, fc, a) + sjj^ X, Cl £ > ^ a ) 

where 

X, C; e, fc, a) := 02(0, X, & + ea) , 
92,2(0 iX,()£,k, a) := — — (cosC + AP^o) ( cos(k 4 + k 5 ) - cos(k 4 ) 



(D.2) 



(D.3) 



(D.4) 



K 2 



K 



cosC + AP x0 ) ( cos(k4)[cos(k5) — 1] — sin(K4) sin(ft5) 



— (cos ( + AP^o) ( — 2 cos(k4) sin (k 5 /2) 



—2 cos(ft5/2) sin(/t 5 /2) sin(ft4) 
2K 2 



cos ( + APco) sin(K 5 /2) I cos(/C4) sm(ft 5 /2) + cos(k 5 /2) sin(/C4) 
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(D.5) 

with 

1^4(6, C, e, k, a) := — £ _ T sin £ — Ti sin 2£) — ea( , 
K5(^, C, e, a) := £0(6* — T sin ( — Ti sin 2£) . 

(D.6) 

We obtain from flB12|l . flD^|l 

0?i(0> X, C; e, fc, a) = 5-2(6*, x, C; e, & + ea) 

cos^ - Q(C)]) f(cosC + AP x0 )[~(q + s 2 k x ) 



U Z (U Z + + e X ) 3 v L """"V /L ^ 

+ X n z (IL J + 1)(1 + ex) 2 ] - sk 2 (U z + + ex) 2 ] , (D.7) 



where IT^ = n z (0, x, C, £ , & + £a ) an d K 2 = ft2(6\C,^ + £ a ) whence, by 

(EH]), (EH, 

lim [#Ji(0,X,C;£,M)] = Urn [# 2 (0,x,C;£,k)] 

= X K 2 cos(k[9 - Q(C)])(cosC + AP x0 ) • (D.8) 

Clearly, by (TLT6]) . 

sin(K 5 (0,C,£,a)/2) a . 
hm J = -(0 - T sin C - Ti sin 2g , 

lim [« 5 (0,C,e,a)] = , 



lim [k4,(9, C, £, k, a)] = k{6 — ( — T sin ( — T 1 sin 2() 

e— »0+ 



(D.9) 



whence, by flD.5j) . 

lim \g* 2 (9, x, C; e, k, a)} = K 2 a(6 - T sin C - T 1 sin 2() 

xsin(A;[0-C-TosinC-T 1 sin2C])(cosC + AP X . ) , (D.10) 
so that, by flE^flEgj) , 

hm \g*(9, x, C, e, K a)] = X K 2 cos(k[9 - Q(C)])(cos C + AP x0 ) 

e— >0+ 

+K 2 a(9 - T sin ( - Ti sin 2£) 

xsin(A;[0-C-TosinC-T 1 sin2C])(cosC + AP a; o) • (D.ll) 
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E Bounds on gf, g^ 

Let e G (0,e ] with e G (0,1] and let k G N,a G [-1/2,1/2]. Let also 
(#o,Xo) £ W{eq). Moreover let Xibi^o) satisfy the restriction ( 1C.1I) where 
Xm is the positive constant from Theorem [2] (see item 2 of the setup list for 
Theorem |2]). Furthermore we assume that 

(0, X, e (do " d u 9 + dj) x (xo - tfe, Xo + d 2 ) x R , (E.l) 

where xo, d±, d 2 satisfy 

< d x , < d 2 < xo ~ Xib(eo) ■ (E.2) 



In this appendix we will prove the properties ( TE.6j) . (lE.14j) of and g 2 . 



We thus show in this appendix that the properties (1E.6j) . (1E.14[) hold in the 
situation of Theorem [2] (see item 8 of the setup of Theorem [2]). Since all 
assumptions of this appendix are also satisfied in Appendix [C] and Appendix 
|D| we can apply the results of those appendices. 

We first consider and we obtain from (I3.5ip that 

|0?(0,X,C,e,M)l = \gi(0,X,C,£,k + ea)\ , (E.3) 

whence, by ( l(X27j) . 

\9i(d,xX,£,k,a)\ < C 1 {xo,£o,k + ea,d 2 ) , (E.4) 

where C\ is given by (IC27[) . Note that, by (IC27j) . Ca(xo, £ o, v i d 2 ) is decreas- 
ing w.r.t. v whence 

Ci(xo, e , k + ea, d 2 ) < Ci(xo, £o, V 2 , d 2 ) =: C* (xo, £o, d 2 ) , (E.5) 
so that, by {Eg) , 

| #f (0, x, C, e, M) I < Cf (xo, e , 4) , (E.6) 

where C/ 2 is given by (1E.5|) . 

We now consider g% and we obtain from flD.3|) that 

\g*(6,x,(,e,k,a)\ < \g^(6, X , C, e, a)\ + \g? fi (6, X , C; e, a) I • (E.7) 
Note that, by (1030]) . flLTij) . 



l#Ji(0,X,C;£,M)l = |^ 2 (0,x,C;^,^ + ^a)| < C 2 (xo,^o,^ + ^o,4) , (E.8) 



where C 2 is given by (IC.30p . Note that, by (lC.30p . C 2 (xo; £ o, d 2 ) is decreas- 
ing w.r.t. v whence 

C 2 (xo, Eo, k + ea, d 2 ) < C 2 (xo, £o, 1/2, d 2 ) -■ C^(xo, £o, d 2 ) , (E.9) 
so that, by (lK8|) . 

l^fiC^X, C, e, A;, a)| < C^( Xo , £ , d 2 ) , (E.10) 
where is given by flE.9[) . We also have, by (1D.5[) . 

\g* 2 (8,X,C,£,k,a)\ 
2K 2 



< 



e 

4K 2 

e 



(cosC + AP x0 ) sin(ft 5 /2) ^cos(k 4 ) sin(K 5 /2) + cos(k 5 /2) sin(/€ 4 ) 
sin(«: 5 /2)|(l + |AP x .o|) . (E.ll) 



Of course, by (jOLljElj). 

= -|sin(y[0- ToSinC- Tisin2C])| 



< 



e 
\a\ 



9 — T sin ( — Ti sin 2Q 



< ^(Ifl + ITol + ITxl) 



< yW+di + lTol + ITxl), 



(E.12) 



whence, by (IE. lip . 

\g* 2 (9, X ,(;£,k,a)\ < 2K 2 \a\{\ + \ AP x0 \)(\9 \ + d x + |T | + 1^1) 

=:<7* 2 (0 o ,Mi). (E.13) 

We conclude from (jETT ]) . (jETTOjl . (lETT3|l that 

l02*(0,X,C>e>M)l < C 2A {xo,£o, d 2 ) + C£ 2 (0 O , a, di) 

=: (0 O , Xo, eo, a, di, d 2 ) , (E.14) 

where is given by (1E.9|) and is given by (IE. 13[) . 

With ()E.6p . (1E.14p we have shown that gf (•, k, a) and g 2 (-, k, a) are bounded 
for k EN, \a\ < 1/2 for the points 



X,C,£) e (0 o -di,0 o + di) x (Xo - d 2 , Xo + d 2 ) xMx (0,e ] .(E.15) 
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F Error bounds in a regular perturbation prob- 
lem 



Here we outline a derivation of error bounds in a regular perturbation prob- 
lem of relevance for §3.4.41 This could be made into a theorem and proof 
at the level of §3.51 and §@] but we leave this to the interested reader (see 
[25l §2] for a detailed discussion of regular perturbation theory relevant here, 
complete with a theorem and proof). We write the I VP in ( I3.109P as 

x[=x 2 , zi(0) = f, (F.l) 
x' 2 = — esinxi , x 2 (0) = 1 . (F-2) 

Then the zeroth-order approximation is 

u[ = u 2 , «i(0)=£, (F.3) 

u' 2 = 0, « 2 (0) = 1, (F.4) 

with solutions 

ui(s) = s + £, u 2 (s) = l. (F.5) 
Subtracting and integrating we obtain 

ei := - < / \x 2 (r) - u 2 (r)\dT , (F.6) 

e 2 := \x 2 (s) - u 2 (s)\ = e| / [sin(xi(r)) - sin(u 1 (r)) + sin(r + £)]dr\ 

Jo 

<e / \xi(t) - ui(r)\dr + e\ - cos(s + £) + cos(£)| 
</o 

< e [ |x 1 (r)-ui(r)|dr + 2e . (F.7) 

Introducing i?i and i?2 as in §U we have 

ex(s) < f e 2 (r)dr =: R^s) , (F.8) 
Jo 

e 2 (s)<e[ e 1 (r)dT + 2e=: R 2 {s) . (F.9) 
•/ o 
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Differentiating gives the differential inequalities 



Let 



R[ = e 2 <R 2 , R 1 (0) = 0, (F.10) 
R> 2 = eey < eR 1 , R 2 {0) = 2e . (F.ll) 



w' 1 =w 2 , w 1 (0) = 0, (F.12) 
w' 2 = e Wl , w 2 (0) = 2e. (F.13) 



Then 



Wl = y^2sinh(Ves) , (F.14) 
w 2 = e2cosh(v^s) . (F.15) 

Now as shown in Appendix HJ Ri(s) < Wi(s) and R 2 (s) < w 2 (s) whence if 
< s < T, 

ei(s) < \/e2 sinh(Ves) < \fe2 sinh(v^T) = 0(e) , (F.16) 
e 2 (s) < e2 cosh(v^s) < e2 cosh( v / eT) = 0(e) . (F.17) 

In the context of §3.4.41 with e = 1/Yq,C, = X Q , x\ = X,x 2 = Y we obtain 
from flFA6|). flRT7|) that X(s) = s + X + O(l/Y 2 ),Y(s) = 1 + O(l/F 2 ) 



whence X(t) = Y t + X + O(l/F 2 ), Y(t) = Y (l + O(l/Y )). 

G Derivatives for Low Gain Problem 

We here derive ( 1G.6I) which is needed in §3.71 By ( 13 . 1 T2I) we have 

v[(-, 1) = 2v 2 (; 1) , Vl (0, 1) = 9 , v' 2 (t, 1) = -K (k) cos(^ 1 (r, 1) - ar) 

= -^Ux P ( M (t)) + cc, 1^,(0, l) = Xo, (G.l) 
where 

u{r) :=i[Awi(r,l) - ar] . (G.2) 
It follows from flGll) that 
■^'(t, 1) = K Q (k)(kv [(t, 1) — a) sin(fcui(r, 1) — ar) 
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K (k) 

= K (k)(2kv 2 (T, 1) — a) sin(/c^ 1 (r, 1) — ar) = — exp(u(r))u'(r) + cc , 

4"(-> i) = ~ K °2 ex p( M )K' + (V) 2 ] + cc > 

^'"('j 1) = exp («)[«'" + 3m 'u" + (V) 3 ] + cc , 

(G.3) 

and from (jGlli.dCT. flG~3]) that 



«'(r) = zffcuiC-, 1) - a] = i[2kv 2 (-, 1) - a] , 

w"(t) = i2kv' 2 (r, 1) = —i2kK (k) cos(kvi(r, 1) — ar) , 

m"'(t) = i2^(r, 1) = i2kK (k)(2kv 2 (r, 1) - a) sin(^ 1 (r, 1) - ar) . 

(G.4) 

We conclude from (fG~H . (ITI21 . flCTIj) that 



u(0) = iAwi(0,l) = ik9 , 
it'(O) = i [2Ara 2 (0, 1) - a] = i[2fcxo - a] , 
u"(0) = -i2kK (k) cos(A;t; 1 (0, 1)) = -i2kK {k) cos(k9 ) , 
u"'(0) = i2kK (k)(2kv 2 (0, 1) - a) sin(Arai(0, 1)) 
= i2kK (k){2k X o - a) sin(fc0 o ) , 

(G.5) 

whence, by (ROl.dCT. 

^(0, 1) = -K (k) cos(Arai(0, 1)) = -K (k) cos(fc0 o ) , 

v£(0, 1) = #<,(*;) (2fcu2(0, 1) - a) sin(^i(0, 1)) = K (k)(2k X o - a) sin(fc0 o ) , 
<(0, 1) = -^^exp( M (0))K / (0) + (u'(0)) 2 } + cc 



_K^)_ exp ^^ Q ) ^-i2kK {k) cos(k9 ) - [2k Xo 



a] 2 I + cc 



-K Q (k) 2kK (k) sm(k6 ) cos(k6 ) - [2kx ~ a] 2 cos(k6 ) 



= K (k) \-kK (k) sm(2k6 ) + [2k X o - a} 2 cos(k6 ) j , 
4"'(0, 1) = -^^l e xp(n(0))[n" / (0) + 3u'(0)u"(0) + (u'(0)) 3 ] + cc 
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k ' exp{ik6 ) ( i2kK {k){2k X o - a) sin(fc0 o ) 



2 

+6A;A' (A;)[2A;xo - a] cos(fc0 o ) - i[2kx - a] 3 \ + cc 

+12/c_K" (^)[2/cxo — a] cos 
K (k) \2kK (k)(2kxo ~ a) sin 2 (kd ) 
—6kK (k)[2kxo — a ] cos (fc0 o ) - [Zkxo - a] 3 sin(fc0 o 



(G.6) 



H Calculation of E r /cB u in high gain regime 

In this appendix we aim to estimate the magnitude of the electric field. The 
basic field equation is 

(-^-o 2 -^)E x (z,t)^-cZ vac -^(z,t) , (H.l) 

where Z vac = 1 /ceo is the free space impedance and 

ecK v^, 1 

j(z,t) ■= -—^cos(k u z) V — — 6(z-z n (t)) 

S± ^ ln{t) 

ecKN 



cos(k u z)-J2 5 t z ~ z n(t)) , (H.2) 



7cS± v 'N 



n=l 



with Sj_ being the transverse emittance, see [2] and [31] . We proceed in two 
ways. In the first we solve (IH.ip and f]H.2|) directly and in the second we use 
Fourier tranforms. 

The unique solution of the homogeneous IVP at t = is 

E x (z,t) = ^ ds dy—(y,s) 

^ JO Jz-ct+cs as 
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■^[U-{z,t) + U + {z,t)], (H.3) 



where 



U-(z,t):= [ dy\j(y,t+-(y-z))-j(y,0)}, (H.4) 

J z-ct c 
rz+ct i 

U + (z,t):= / «(!/,t--(j/-«))-j(j/,0)]. (H.5) 

Jz C 

The first equality in (IH.3j) is often obtained using Duhamel's principle and 
d'Alembert's formula and the second equality is obtained after changing the 
order of integration. To obtain our estimate we consider z n {t) = (3 c ct + z n (0) 
which is quite crude (but may suffice for a rough estimate) and where the 
nonnegative j3 c is determined by (3% = (7^ — l)/j 2 . We obtain [32] U + <C U- 
and 

TV 

EL. CM) « - 2e °^ cN lj2 In(z,t)cos(2k ul 2 c [z-ct-z n (0)}) , (H.6) 



N 

n=l 



where 



In(z,t):={ I X Znit) < Z < Z n (0) + Ct 

v ' ' ' if otherwise . v ' 



So if all the particles contributed at z, which they don't, then U-(z,t) = 
0{ 2ec ^l cN ) and E rl = z ™^ c n wQuld bg ft typical value of the field ^ at 

(z,f). ^ 

We now give a second estimate, E r2 , of i? r . Following [21] which is based 
on [2] we Fourier transform flH.ip by defining 

1 f°° z s 

EJz,u) := — / dsE x (z, — ) exp(-zws) . (H.8) 

2tt c cfc r 

The Fourier inversion theorem gives 

/oo 
duE x (z, uj) exp(iuk r [z — ct]) . (H.9) 
-00 

We define j(z,u) in the same way as E x (z,u) whence, in the slowly varying 
approximation, (IH.lj) reduces to 

^(z,co) = -^](z,co), (H.10) 
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and from (1H.2|) we obtain 



i , \ ecKNk r . . 

j(z,w) = -— — — =-j(z,w), H.ll) 
2vr^ c 7 c E_L 



where 

1 N 

j(z,u>) := cos(/c n z) exp(— iujk T z) — exp(ia;cfc r T n (z)) . (H.12) 

71=1 

Here the function T n is the inverse of the function z n . To obtain our estimate 
we note that \j\ is bounded by 1 and replace it by 1 which is quite crude but 
may suffice for a rough estimate. Inserting this into (IH.lOp and integrating 
we obtain 

£, / \ n ( z vac ecKNk r 1 \ 

^ (Z ' W) = °^2^I^J' (H - 13) 

and, for /c M 2; = 0(1), 

4 = 0(E r2 ), ^ 2 :=^ e( 5 iV , ^ 2 7c- (H.14) 
We now have, recalling that K = 3.7 in LCLS, 

^1 = 4vr^ = 4ir/K r = 2tt(1 + — ) w 2tt(1 + (3.7) 2 /2) « 49 ,(H.15) 

and we calculate E r ijcB u . From ( 1H.14I) 

E r -2 = Z vac c eK k r N 

cB u An cB u k u il C £_l ' 
Now K/cB u = e/mc 2 k u and k r /k u ^ 2 = 2(1 + K 2 /2)~ 1 therefore 

E r2 _ Z vac c e 2 1 2 N_ = 1 2 

aB u 4tt mc 2 A; u (l + ir 2 /2) 7c S ± ^ (1 + A 2 /2) 7c £ ± ' 1 ' j 
where r e denotes the classical electron radius. Furthermore 

3c?77< 2 

r e « 2.82 ■ 10~ 15 m , — = , — — « 0.255 , 7 C = 10 4 

' k u 2tt ' (l + A' 2 /2) ' ,c 

and so 

— ^ « 0.034 ■ 10~ 12 m 2 — « 34 , — ^- = — « 34 • 49 « 1700 , 

cB u Ej_ c.D n cB u E r2 

for iV = 10 9 and £ i = 1mm 2 . 



94 



I IVP for a system of differential inequalities 

Here we present and verify a solution of the IVP for a system of differential 
inequalities which is used in §4.1[§4.2l and Appendix [Fj Consider the IVP for 

R[(C) < a x R 2 {Q , (1.1) 
R' 2 (C) < a 2J Ri(C) , (L2) 

where a\,a 2 > and Ri,R 2 are of class C 1 . We want to show, for £ > 0, 
that 

«,(C)<r,(C) : i? 2 (C)<r 2 (C), (L3) 

where 

r[ = ai r 2 , n(0) = i2i(0) , (1.4) 

r 2 = a 2 n , r 2 (0) = i? 2 (0) . (1.5) 

We do this in two ways. First we define fj(C) '■= Rj(C)~ r j(C) f° r 3 — 1,2, £ > 
whence, by (ITT]) . Q . flLij) . 

m, 

ri(C)<air 2 (C), ^(0 < ^(C) , ^(0) = f 2 (0) = . (1.6) 
Clearly we have to show that, for j = 1, 2, £ > 0, 

<V(C) < . (1.7) 

It follows from ( 1I.6[) that 



^i(C) < °i / dsr 2 (s) < a±a 2 / dsfi(s) 
Jo Jo 

r' 2 (C) <a>2 dsf[(s) < a x a 2 / dsf 2 (s) 
Jo Jo 



i.e., 



.2 



r'(C)<^ / dafi(a), (1.8) 
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where := y/a^- It follows from ( II. 8 P and by partial integration that 



exp(— aoC)fj'(C) + a o / ds exp(—a s)fj(s) = / dsexp(- 
</o Jo 

< al dsexp(-aos) / dsfj(s) 
Jo Jo 

K K 
= — a exp(— a £) / <isf j(s) + ao / dsexp(— aos)fj(s) , (1.9) 
Jo Jo 



which entails 



rj(C) < -a I dsr,{s) . (1.10) 



o 



Abbreviating 



f,(C) := / dsrjis) , (1.11) 







we obtain from (ll.lOp 

*}(C) = -^(C/ao) < - / dsfjis) = -f,(C) , (1.12) 
a o Jo 

whence 

> exp(C)[f,(C) + r;(C)] = [expCCtoCC)]' , (1-13) 
so that exp(C)fj(C) is decreasing w.r.t. ( which entails, by (II. lip , that 



= exp(0)f J (0)>exp(C)r J (C), (1.14) 



i.e., 



fjiC) < . (1.15) 
We conclude from n. fll^ . flLTol) that 

r'jiC) < 4 I dsf 3 (s) = rjiaoQ < , (1.16) 
Jo 

96 



whence fj(Q is decreasing w.r.t. C so that (II. 7N follows from ( 11.61) . 

The result in (II. 3[) is a special case of a much more general theorem on 
pages 112-113 of [26]. That proof simplifies in the special case here and we 
present it for the interested reader. The proof proceeds by cleverly introduc- 
ing a comparison function h. Here 

h(0 = (t(c)) :=a4exp(2a3 °(i) ' (L17) 

where 03 := max{a\, a 2 ), a 4 > 0. Then 

h[ = 2a 3 /ii = 2a 3 h 2 > a x h 2 , (1-18) 

h' 2 = 203/12 = 203/11 > a 2 h\ , (1-19) 

and we have, by fll.6j) . 

f\ - ciih <0<h[- a x h 2 , (1.20) 

r 2 - a 2 fi < < h' 2 - a 2 h x . (1.21) 

We now show that, for j — 1, 2, C > 0, 

f,(C) < fy(C) • (1-22) 

Suppose that (II. 22|) is wrong then there exists a smallest Co > such that an 
index j Q exists with 

r*(Co) = MCo) , (1-23) 
where we used that, by ( II.6j) .( II.17]) and for j = 1,2, 

^•(0) = < a 4 = hj(0) . (1.24) 
Clearly, for j = 1,2,0 <C< Co, 

f,(C) < fc,(C) ■ (1-25) 
Without loss of generality we take jo = 1 whence, for < C < Co; 

h(0 < h 2 (() . (1.26) 
It follows from ( \l.25\i that at the first intersection 

r[(Co) > K(Co) ■ (1.27) 

But by (|L2^ .( 1L26|) 

^i(Co) - ^i(Co) < ai(r 2 (Co) - h 2 (Co)) < , (1.28) 
which is a contradiction. 



97 



References 



[1] P. Baxevanis, R.D. Ruth, Z. Huang, "General method for analyzing 3-D 
effects in FEL Amplifiers", PRST-AB, 16, Of 0705 (2013). 

[2] K-J Kim, Z. Huang, R. Lindberg, "Introduction to the Physics of Free 
Electron Lasers", Lecture Notes for USPAS, Boston, June 2010, revised 
January 2012, unpublished. 

[3] P. Schmuser, M. Dohlus, J. Rossbach, "Ultraviolet and Soft X-Ray Free- 
Electron Lasers", Springer Tracts in Modern Physics 229, (Springer- 
Verlag, Berlin, 2008) 

[4] J.B. Murphy and C. Pellegrini, "Introduction to the Physics of the Free- 
Electron Laser", in Laser Handbook, Vol. 6, pp.9- 70, W.B. Colson, C. 
Pellegrini and R. Renieri eds., North- Holland, 1990. 

[5] E.L. Saldin, E.A. Schneidmiller and M.V. Yurkov, "The Physics of Free 
Electron Lasers", Springer- Verlag, Berlin, 2000. 

[6] J. Murdock, "Perturbations: Theory and Methods", Classics in Applied 
Mathematics 27, (SIAM, 1999) First published by Wiley, 1991. 

[7] K. Nozaki and Y.Oono, "Renormalization-group theoretical reduction", 
PRE, 63, (1993) 046101. 

[8] The literature on Hamiltonian perturbation theory (HPT) is vast. Basi- 
cally it involves canonical transformations to simplify the Hamiltonian 
in leading order of a perturbation parameter. The transformations are 
often defined in terms of Lie generating functions (e.g., Lie Series or 
transformations) which is more direct than the use of mixed generating 
functions (although not necessarily better). Two highlights of HPT are 
the statements and proofs of the Nekhoroshev and KAM theorems. The 
Nekhoroshev theorem can be viewed as the ultimate averaging theorem. 
An interesting discussion of these theorems, with a focus on the KAM 
case, can be found in the book "The KAM Story: A Friendly Introduc- 
tion to the History, Content, and Significance of Classical Kolmogorov- 
Arnold-Moser Theory", by H. Scott Dumas to be published by World 
Scientific (See also Chapter 7 of [9]). The Lie method is briefly dis- 
cussed in §3.11. A Hamiltonian, Lie transformation approach to a more 



98 



general version of the problem we consider in this paper is being pur- 
sued by R. R. Lindberg. His approach is influenced by the early work 
of Littlejohn on the so-called guiding-center motion. Lindberg's article 
is entitled "A derivation of the three-dimensional free-electron parti- 
cle equations based upon Lie transformation techniques" and a recent 
review of the guiding-center problem is presented in "Hamiltonian the- 
ory of guiding-center motion", J.R. Cary and A.B. Brizard, Reviews on 
Modern Physics, 81, April- June 2009. 

P. Lochak, C. Meunier, "Multiphase Averaging for Classical Systems: 
With Applications to Adiabatic Theorems" , Applied Mathematical Sci- 
ences 72 ( Springer- Verlag, New York 1988), Translated by H.S. Dumas. 

J. A. Sanders, F. Verhulst, J. Murdock, 'Averaging Methods in Nonlin- 
ear Dynamical Systems," Second Edition, Applied Mathematical Sci- 
ences 59 (Springer, New York 2007) 

J. Murdock (2006) Normal forms. Scholarpedia, 1(10):1902. 

J. A. Sanders (2006) Averaging. Scholarpedia, 1(11):1760. 

J. A. Ellison, A.W. Saenz, H.S. Dumas, Improved nth order averaging 
theory for periodic systems, J. Differential Equation, 84, 383 (1990). 

H.S. Dumas, J. A. Ellison, K. Heinemann, 'Averaging for Quasiperiodic 
Systems with Applications", in progress. 

B.J. Besjes, On the asymptotic methods for nonlinear differential equa- 
tions, J. Me'canique, 8, 357 (1969). 

James A. Clarke, "The Science and Technology of Undulators and 
Wigglers", Oxford Series on Synchrotron Radiation, Oxford University 
Press, 2004. 

Z. Huang, K-J Kim, "Review of x-ray free-electron laser theory", PRST- 
AB, 10, 034801 (2007) 

The well known phase plane portrait for the pendulum equation can be 
found in many places, e.g., see Fig. 6.4, p. 248 of [i~9] . 



F. Brauer, J. A. Nohel, "The Qualitative Theory of Ordinary Differential 
Equations: An Introduction", Dover Publications, 1989. 



99 



[20] The potential plane is simply a plot of the "potential" 1 — cos X versus 
X placed above the phase plane portrait. This is nicely illustrated in 
[15] . Figs. 6.5 and 6.6 on p. 249. See also Fig. 95, p. 142 of [22]. 

[21] See e.g., [T9] Section 6.2 for a derivation of the period as a function 
of amplitude as well a proof that the integral is well defined as an im- 
proper integral (note that the integrand is singular). The formula for 
the pendulum equation is given explicitly on p. 244. 

[22] V.I. Arnold, "Ordinary Differential Equations", 3rd edition translated 
by R. Cooke, Springer- Verlag, Berlin, 1991. 

[23] There are several good sources for the theory of second-order au- 
tonomous systems. Here we mention Chapters 15 and 16 in E.A. Cod- 
dington, N. Levinson, "The Theory of Ordinary Differential Equations" , 
McGraw-Hill, 1955, Chapters 4 and 5 in W. Hurewicz, "Lectures on Or- 
dinary Differential Equations", M.I.T. Press, 1958, Chapters 5 and 6 
in R.A. Struble, "Nonlinear Differential Equations" , McGraw-Hill, 1962 
and Chapter IV in G. Sansone and R. Conte, "Non-Linear Differential 
Equations" , The Macmillian Company, New York, 1964. 

[24] J. K. Hale, "Ordinary Differential Equations" (Krieger Publishing Com- 
pany, Malabar, Florida, 1980) 

[25] J. A. Ellison, H.J. Shih, The Method of Averaging in Beam Dynamics, 
invited paper in Accelerator Physics Lectures at the Superconducting 
Super Collider, AIP Conference Proceedings 326, edited by Y. Yan and 
M. Syphers (1995) 590-632. 

[26] W. Walter, "Ordinary Differential Equations," (Springer- Verlag, New 
York 1998). 

[27] H.S. Dumas, J. A. Ellison, A.W. Saenz, "Axial Channeling in Perfect 
Crystals, the Continuum Model, and the Method of Averaging" , Annals 
of Physics, 209 (1991) 97-123. 

[28] H.S. Dumas, J. A. Ellison, F. Golse, "A mathematical theory of planar 
particle channeling in crystals" , Physica D 146 (2000) 341-366. 



100 



[29] H.S. Dumas, J. A. Ellison, M. Vogt, First-Order Averaging Theorems 
for Maps With Applications to Accelerator Beam Dynamics, SIAM J. 
Applied Dynamical Systems, 3, 409 (2004). 



[30] The literature on separatrix splitting is large, our work is contained 
in J. A. Ellison, M. Kummer, A.W. Saenz, "Transcendentally small 
transversality in the rapidly forced pendulum", Journal of Dynamics 
and Differential Equations, 5, (1993) 241-277 and M. Kummer, J. A. 
Ellison, A.W. Saenz, "Exponentially small phenomena in the rapidly 
forced pendulum". In S. Tanveer, H. Segur, H. Levine (eds.), Asymp- 
totics Beyond All Orders, NATO ASI Series B: Physics 284, (1992) 
197-211. 

[31] R. Warnock, One-Dimensional FEL Equations, Notes, January 30, 2013. 

[32] J. A. Ellison, K. Heinemann, Unpublished Notes on collective ID FEL 
Theory, November 2012. 



101 



